Web Content Analysis: Expanding the Paradigm


Are established methods of content analysis (CA) adequate to analyze web content, or should new methods be devised to address new technological developments? This article addresses this question by contrasting narrow and broad interpretations of the concept of web content analysis. The utility of a broad interpretation that subsumes the narrow one is then illustrated with reference to research on weblogs (blogs), a popular web format in which features of HTML documents and interactive computer-mediated communication converge. The article concludes by proposing an expanded Web Content Analysis (WebCA) paradigm in which insights from paradigms such as discourse analysis and social network analysis are operationalized and implemented within a general content analytic framework.


  1. Ali-Hasan, N., & Adamic, L. (2007). Expressing social relationships on the blog through links and comments. Paper presented at the international conference for weblogs and social media, Boulder, CO.Google Scholar
  2. Balog, K., Mishne, G., & Rijke, M. (2006). Why are they excited? Identifying and explaining spikes in blog mood levels. Paper presented at the 11th meeting of the European Chapter of the Association for Computational Linguistics, Trento, Italy.Google Scholar
  3. Baran, S. J. (2002). Introduction to mass communication (2nd ed.) New York: McGraw-Hill.Google Scholar
  4. Bates, M. J., & Lu, S. (1997). An exploratory profile of personal home pages: Content, design, metaphors. Online and CDROM Review, 21(6), 331–340.CrossRefGoogle Scholar
  5. Bauer, M. (2000). Classical content analysis: A review. In M. W. Bauer & G. Gaskell (Eds.), Qualitative researching with text, image, and sound: A practical handbook (pp. 131–151). London: Sage.Google Scholar
  6. Berelson, B. (1952). Content analysis in communication research. New York: Free Press.Google Scholar
  7. Berelson, B., & Lazarsfeld, P. F. (1948). The analysis of communication content. Chicago/New York: University of Chicago and Columbia University.Google Scholar
  8. Blood, R. (2002). Introduction. In J. Rodzvilla (Ed.), We’ve got blog: How weblogs are changing our culture (pp. ix–xiii). Cambridge, MA: Perseus.Google Scholar
  9. Bush, C. R. (1951). The analysis of political campaign news. Journalism Quarterly, 28(2), 250–252.Google Scholar
  10. Dimitrova, D. V., & Neznanski, M. (2006). Online journalism and the war in cyberspace: A comparison between U.S. and international newspapers. Journal of Computer-Mediated Communication, 12(1), Article 13. Retrieved from http://jcmc.indiana.edu/vol12/issue1/dimitrova.html
  11. Efimova, L., & de Moor, A. (2005). Beyond personal web publishing: An exploratory study of conversational blogging practices. Proceedings of the Thirty-Eighth Hawaii International Conference on System Sciences. Los Alamitos, CA: IEEE.Google Scholar
  12. Fogg, B. J., Kameda, T., Boyd, J., Marshall, J., Sethi, R., Sockol, M., et al. (2002). Stanford-Makovsky web credibility study 2002: Investigating what makes web sites credible today. Retrieved from http://captology.stanford.edu/pdf/Stanford-MakovskyWebCredStudy2002-prelim.pdf
  13. Foot, K. A., Schneider, S. M., Dougherty, M., Xenos, M., & Larsen, E. (2003). Analyzing linking practices: Candidate sites in the 2002 U.S. electoral Web sphere. Journal of Computer-Mediated Communication, 8(4). Retrieved from http://jcmc.indiana.edu/vol8/issue4/foot.html
  14. Gibson, G., Kleinberg, J., & Raghavan, P. (1998). Inferring web communities from link topology. Proceedings of the 9th ACM Conference on Hypertext and Hypermedia. Pittsburgh, PA: ACM.Google Scholar
  15. Glaser, B., & Strauss, A. L. (1967). The discovery of grounded theory: Strategies for qualitative research. Chicago: Aldine.Google Scholar
  16. Herring, S. C. (2004). Computer-mediated discourse analysis: An approach to researching online behavior. In S. A. Barab, R. Kling, & J. H. Gray (Eds.), Designing for virtual communities in the service of learning (pp. 338–376). New York: Cambridge University Press.Google Scholar
  17. Herring, S. C., & Paolillo, J. C. (2006). Gender and genre variation in weblogs. Journal of Sociolinguistics, 10(4), 439–459.CrossRefGoogle Scholar
  18. Herring, S. C., Kouper, I., Paolillo, J., Scheidt, L. A., Tyworth, M., Welsch, P., et al. (2005). Conversations in the blogosphere: An analysis “from the bottom up.” Proceedings of the Thirty-Eighth Hawai’i International Conference on System Sciences. Los Alamitos, CA: IEEE.Google Scholar
  19. Herring, S. C., Scheidt, L. A., Bonus, S., & Wright, E. (2004). Bridging the gap: A genre analysis of weblogs. Proceedings of the Thirty-Seventh Hawai’i International Conference on System Sciences. Los Alamitos, CA: IEEE.Google Scholar
  20. Herring, S. C., Scheidt, L. A., Bonus, S., & Wright, E. (2005). Weblogs as a bridging genre. Information, Technology & People, 18(2), 142–171.CrossRefGoogle Scholar
  21. Herring, S. C., Scheidt, L. A., Kouper, I., & Wright, E. (2006). Longitudinal content analysis of weblogs: 2003–2004. In M. Tremayne (Ed.), Blogging, citizenship, and the future of media (pp. 3–20). London: Routledge.Google Scholar
  22. Holsti, O. R. (1969). Content analysis for the social sciences and humanities. Reading, MA: Addison Wesley.Google Scholar
  23. Huffaker, D. A., & Calvert, S. L. (2005). Gender, identity and language use in teenage blogs. Journal of Computer-Mediated Communication, 10(2). Retrieved from http://jcmc.indiana.edu/vol10/issue2/huffaker.html
  24. Jackson, M. (1997). Assessing the structure of communication on the world wide web. Journal of Computer-Mediated Communication, 3(1). Retrieved from http://www.ascusc.org/jcmc/vol3/issue1/jackson.html
  25. Krippendorff, K. (1980). Content analysis: An introduction to its methodology. Newbury Park: Sage.Google Scholar
  26. Krippendorff, K. (2008). Testing the reliability of content analysis data: What is involved and why. In K. Krippendorff & M. A. Bock (Eds.), The content analysis reader (pp. 350–357). Thousand Oaks, CA: Sage. Retrieved from http://www.asc.upenn.edu/usr/krippendorff/dogs.html
  27. Kutz, D. O., & Herring, S. C. (2005). Micro-longitudinal analysis of web news updates. Proceedings of the Thirty-Eighth Hawai’i International Conference on System Sciences. Los Alamitos, CA: IEEE.Google Scholar
  28. McMillan, S. J. (2000). The microscope and the moving target: The challenge of applying content analysis to the world wide web. Journalism and Mass Communication Quarterly, 77(1), 80–98.MathSciNetGoogle Scholar
  29. Mishne, G., & Glance, N. (2006). Leave a reply: An analysis of weblog comments. Proceedings of the 3rd Annual Workshop on the Weblogging Ecosystem, 15th World Wide Web Conference, Edinburgh.Google Scholar
  30. Mitra, A. (1999). Characteristics of the WWW text: Tracing discursive strategies. Journal of Computer-Mediated Communication, 5(1). Retrieved from http://www.ascusc.org/jcmc/vol5/issue1/mitra.html
  31. Mitra, A., & Cohen, E. (1999). Analyzing the web: Directions and challenges. In S. Jones (Ed.), Doing internet research: Critical issues and methods for examining the net (pp. 179–202). Thousand Oaks, CA: Sage.Google Scholar
  32. Nakajima, S., Tatemura, J., Hino, Y., Hara, Y., & Tanaka, K. (2005). Discovering important bloggers based on analyzing blog threads. Paper presented at WWW2005, Chiba, Japan.Google Scholar
  33. Park, H. W. (2003). What is hyperlink network analysis? New method for the study of social structure on the web. Connections, 25(1), 49–61.Google Scholar
  34. Pfeil, U., Zaphiris, P., & Ang, C. S. (2006). Cultural differences in collaborative authoring of Wikipedia. Journal of Computer-Mediated Communication, 12(1), Article 5. Retrieved from http://jcmc.indiana.edu/vol12/issue1/pfeil.html
  35. Scheidt, L. A., & Wright, E. (2004). Common visual design elements of weblogs. In L. Gurak, S. Antonijevic, L. Johnson, C. Ratliff, & J. Reyman (Eds.), Into the blogosphere: Rhetoric, community, and culture of weblogs. Retrieved from http://blog.lib.umn.edu/blogosphere/
  36. Schneider, S. M., & Foot, K. A. (2004). The web as an object of study. New Media & Society, 6(1), 114–122.CrossRefGoogle Scholar
  37. Scott, W. (1955). Reliability of content analysis: The case of nominal scale coding. Public Opinion Quarterly, 17, 321–325.CrossRefGoogle Scholar
  38. Singh, N., & Baack, D. W. (2004). Web site adaptation: A cross-cultural comparison of U.S. and Mexican web sites. Journal of Computer-Mediated Communication, 9(4). Retrieved from http://jcmc.indiana.edu/vol9/issue4/singh_baack.html
  39. Thelwall, M. (2002). The top 100 linked pages on UK university web sites: High inlink counts are not usually directly associated with quality scholarly content. Journal of Information Science, 28(6), 485–493.CrossRefGoogle Scholar
  40. Trammell, K. D. (2006). Blog offensive: An exploratory analysis of attacks published on campaign blog posts from a political public relations perspective. Public Relations Review, 32(4), 402–406.CrossRefGoogle Scholar
  41. Trammell, K. D., Tarkowski, A., Hofmokl, J., & Sapp, A. M. (2006). Rzeczpospolita blogów [Republic of Blog]: Examining Polish bloggers through content analysis. Journal of Computer-Mediated Communication, 11(3), Article 2. Retrieved from http://jcmc.indiana.edu/vol11/issue3/trammell.html
  42. Tremayne, M., Zheng, N., Lee, J. K., & Jeong, J. (2006). Issue publics on the web: Applying network theory to the war blogosphere. Journal of Computer-Mediated Communication, 12(1), Article 15. Retrieved from http://jcmc.indiana.edu/vol12/issue1/tremayne.html
  43. Wakeford, N. (2000). New media, new methodologies: Studying the web. In D. Gauntlett (Ed.), Web.studies: Rewiring media studies for the digital age (pp. 31–42). London: Arnold.Google Scholar
  44. Waseleski, C. (2006). Gender and the use of exclamation points in computer-mediated communication: An Analysis of exclamations posted to two electronic discussion lists. Journal of Computer-Mediated Communication, 11(4), Article 6. Retrieved http://jcmc.indiana.edu/vol11/issue4/waseleski.html
  45. Weare, C., & Lin, W. Y. (2000). Content analysis of the world wide web – Opportunities and challenges. Social Science Computer Review, 18(3), 272–292.CrossRefGoogle Scholar
  46. Wikipedia. (2008). Blog. Retrieved on June 28, 2008, from http://en.wikipedia.org/wiki/Blog
  47. Williams, P., Tramell, K., Postelnicu, M., Landreville, K., & Martin, J. (2005). Blogging and hyperlinking: Use of the web to enhance visibility during the 2004 U.S. campaign. Journalism Studies, 6(2), 177–186.CrossRefGoogle Scholar
  48. Young, J., & Foot, K. (2005). Corporate e-cruiting: The construction of work in Fortune 500 recruiting web sites. Journal of Computer-Mediated Communication, 11(1), Article 3. Retrieved from http://jcmc.indiana.edu/vol11/issue1/young.html

Copyright information

© Springer Science+Business Media B.V. 2009

Authors and Affiliations

  1. 1.School of Library and Information ScienceBloomingtonUSA

Personalised recommendations