Abstract
Online communities are creating a growing legacy of texts in online bulletin board postings, chat, blogs, etc. These texts record conversation, knowledge exchange, and variation in focus as groups grow, mature, and decline; they represent a rich history of group interaction and an opportunity to explore the purpose and development of online communities. However, the quantity of data created by these communities is vast, and to address their processes in a timely manner requires automated processes. This raises questions about how to conduct automated analyses, and what can we gain from them: Can we gain an idea of community interests, priorities, and operation from automated examinations of texts of postings and patterns of posting behavior? Can we mine stored texts to discover patterns of language and interaction that characterize a community?
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Barrett, K. LaPointe, D. & Greysen, K. (Jan. 2004). Speak2Me: Using synchronous audio for ESL teaching in Taiwan. Report R28/0401, Athabasca University, Centre For Distance Education.
Boguraev, B. & Kenned, C. (1999). “Applications of term identification technology: domain description and content characterization”, Natural Language Engineering 5(1): 17–14.
Boguraev, B., Wong, Y. Y., Kennedy, C, Bellamy, R., Brawer, S., and Swartz, J. (1998). Dynamic presentation of document content for rapid on-line browsing. AAAI Spring Symposium on Intelligent Text Summarization, Stanford, CA. 118–128.
Brants, T. (2000). “TnT: A statistical part-of-speech tagger”, in Proceedings of the 6th Conference on Applied Natural Language Processing (Seattle, WA), pp. 224–231.
Cherny, L. (1999). Conversation and community: Chat in a virtual world. Stanford, CA: CSLI Publications.
Crystal, D. (2001). Language and the Internet. Cambridge, UK: Cambridge University Press.
DeSanctis, G. & Poole, M. S. (1994). “Capturing the complexity in advanced technology use: Adaptive structuration theory”, Org. Science, 5(2), 121–47.
Dönmez, P., Rosé, C, Stegmann, K., Weinberger, A. & Fischer, F. (2005). “Supporting CSCL with automatic corpus analysis technology”, CSCL V5: Proceedings of Th 2005 Conference on Computer Support for Collaborative Learning, Taipei, Taiwan. 125–134.
Erickson, T. Herring, S. & Sack, W. (2002). Discourse Architectures: Designing and Visualizing Computer-Mediated Communication. Workshop at the CHI 2002 Conference, Minneapolis, MN.
Fagan, J. L. (1989). “The effectiveness of a nonsyntactic approach to automatic phrase indexing for document retrieval”, Journal of the American Society for Information Science 40(2): 115–132.
Fahy, P.J. (2003). “Indicators of support in online interaction”, The International Review of Research in Open and Distance Learning, 4(1). Retrieved June 13, 2006 from: http://www.irrodl.org/index.php/irrodl/article/view/129/209
Fahy, P.J., Crawford, G. & Ally, M. (2001). “Patterns of interaction in a computer conference transcript”, International Review of Research in Open and Distance Learning, 2 (1). Retrieved June 13, 2006 from: http://www.irrodl.org/index.php/irrodl/article/view/36/74
Garrison, D. R. & Anderson, T. (2003). E-Learning in the 21st Century. London: RoutledgeFalmer.
Hearne, B. & Nielsen, A. (2004). “Catch a cyber by the tale: Online orality and the lore of a distributed learning community”, in Haythornthwaite, C. & Kazmer, M. M. (Eds.) (pp. 59–87). Learning, Culture and Community in Online Education: Research and Practice. NY: Peter Lang.
Herring S. C. (1996). “Gender and democracy in computer-mediated communication”, in R. Kling (Ed.) Computerization and Controversy. 2nd edition. San Diego: Academic Press.
Herring, S.C. (1994). “Gender differences in computer-mediated communication: _Bringing familiar baggage to the new frontier.” Presented at American Library Association convention, Miami, FL. Retrieved June 13, 2006 from: http://www.cpsr.org/prevsite/cpsr/gender/herring.txt
Herring, S.C. (2000). “Gender Differences in CMC: Findings and Implications”, CPSR Newsletter, 18(1). Retrieved June 13, 2006 from: http://www.cpsr.org/issues/womenintech/herring
Herring, S.C. (2003). “Dynamic topic analysis of synchronous chat”, Symposium on New Research for New Media, University of Minnesota, Minneapolis. Retrieved June 5, 2006 from: http://ella.slis.indiana.edu/~herring/dta.html
Herring, S.C., Scheidt, L.A., Kouper, I. & Wright, E. (in press). “A longitudinal content analysis of weblogs: 2003–2004”, in M. Tremayne (Ed.), Blogging, Citizenship and the Future of Media. London: Routledge.
Hmelo-Silver, C. E. (2006). Analyzing collaborative learning: Multiple approaches to understanding processes and outcomes. ICLS’ 06: Proceedings of the 7th International Conference on Learning Sciences, Bloomington, Indiana. 1059–1065.
Krippendorff, K. (2004). Content Analysis. Thousand Oaks, CA: Sage.
Liddy, E.D. (1998). “Enhanced text retrieval using natural language processing”, Bulletin of the American Society for Information Science, 24(4). Available at: http://www.asis.org/Bulletin/Apr-98/liddy.html
McLaughlin, M. L., Osborne, K. K. & Smith, C. B. (1995). “Standards of conduct on usenet”, in S. G. Jones (Ed.), CyberSociety: Computer-Mediated Communication and Community (pp 90–111). Thousand Oaks, CA: Sage.
Mei, Q. & Zhai, C. (2005). “Discovering evolutionary themes patterns from text — an exploration of temporal text mining”, KDD’05 (Chicago, Illinois). 198–207.
Ooi, V. B. Y. (2000). “Aspects of computer-mediated communication for research in corpus linguistics”, Language and Computers, 36, 91–104.
Rafaeli, S. & Sudweeks, F. (1997). “Networked interactivity”, Journal of Computer-Mediated Communication, 2(4). Available online: http://www.ascusc.org/jcmc/vol2/issue4/rafaeli.sudweeks.html
Salton, G. (1988). “Syntactic approaches to automatic book indexing”, in Proceedings of the 26th Annual Meeting on Association for Computational Linguistics, Buffalo, New York. 204–210.
Schmid, H. (1994). “Probabilistic part-of-speech tagging using decision trees”, in Proceedings of International Conference on New Methods in Language Processing. Manchester, UK.
Sixl-Daniell, K. & Williams, J.B. (May 2005). Paralinguistic Discussion in an Online Educational Setting: A Preliminary Study. Retrieved June 13, 2006 from: http://www.u21global.edu.sg/portal/corporate/docs/wp_010-2005.pdf
Stuckey, B. & Barab, S. (forthcoming). “Why good design isn’t enough for websupported communities”, in R. Andrews & C. Haythornthwaite (Eds.), Handbook of Elearning Research, Sage.
Weber, R.P. (1985). Basic Content Analysis. Beverly Hills, CA: Sage.
Wu, H., Zubair, M., & Maly, K. (2006). “Harvesting social knowledge from folksonomies”, in Proceedings of the Seventeenth Conference on Hypertext and Hypermedia (Odense, Denmark, August 22–25, 2006). 111–114.
Zhai, C. (1997). “Fast statistical parsing of noun phrases for document indexing”, in Proceedings of the Fifth Conference on Applied Natural Language Proessing, Washington, DC. 312–319.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag London Limited
About this paper
Cite this paper
Haythornthwaite, C., Gruzd, A. (2007). A Noun Phrase Analysis Tool for Mining Online Community Conversations. In: Steinfield, C., Pentland, B.T., Ackerman, M., Contractor, N. (eds) Communities and Technologies 2007. Springer, London. https://doi.org/10.1007/978-1-84628-905-7_4
Download citation
DOI: https://doi.org/10.1007/978-1-84628-905-7_4
Publisher Name: Springer, London
Print ISBN: 978-1-84628-904-0
Online ISBN: 978-1-84628-905-7
eBook Packages: Computer ScienceComputer Science (R0)