Abstract
Multimodal corpora that show humans interacting via language are now relatively easy to collect. Current tools allow one either to apply sets of time-stamped codes to the data and consider their timing and sequencing or to describe some specific linguistic structure that is present in the data, built over the top of some form of transcription. To further our understanding of human communication, the research community needs code sets with both timings and structure, designed flexibly to address the research questions at hand. The NITE XML Toolkit offers library support that software developers can call upon when writing tools for such code sets and, thus, enables richer analyses than have previously been possible. It includes data handling, a query language containing both structural and temporal constructs, components that can be used to build graphical interfaces, sample programs that demonstrate how to use the libraries, a tool for running queries, and an experimental engine that builds interfaces on the basis of declarative specifications.
Article PDF
Similar content being viewed by others
References
AGTK: Annotation Graph Toolkit (n.d.). Retrieved May 26, 2003 from http://agtk.sourceforge.net/.
ATLAS Project (2000, Rev. February 6, 2003). Retrieved May 26, 2003 from http://www.nist.gov/speech/atlas/.
ATLAS.ti (n.d., Rev. February 18, 2003). Retrieved May 26, 2003 from http://www.atlasti.de/.
Bales, R. F. (1951).Interaction Process Analysis: A method for the study of small groups. Cambridge, MA: Addison-Wesley.
Barras, C., Geoffrois, E., Wu, Z., &Liberman, M. (2001). Transcriber: Development and use of a tool for assisting speech corpora production.Speech Communication,33, 5–22.
Carletta, J. C., McKelvie, D., Isard, A., Mengel, A., Klein, M., & Møller, M. B. (in press). A generic approach to software support for linguistic annotation using XML. In G. Sampson & D. McCarthy (Eds.),Corpus linguistics: Readings in a widening discipline. London: Continuum International.
Clark, J., & DeRose, S. (1999, Revised November 16, 1999).XML Path Language (XPath) Version 1.0. Retrieved May 26, 2003 from http://www.w3.org/TR/xpath.
Day, D., Aberdeen, J., Hirschman, L., Kozierok, R., Robinson, P., &Vilain, M. (1997). Mixed-initiative development of language processing systems. InFifth Conference on Applied Natural Language Processing. Washington, DC: Association for Computational Linguistics.
Evert, S., Carletta, J. C., O’Donnell, T. J., Kilgour, J, Vogele, A., & Voormann, H. (2002).NXT Data Model. Retrieved May 26, 2003 from http://www.ltg.ed.ac.uk/NITE/.
Evert, S., & Voormann, H. (2002).NXT Query Language. Retrieved May 26, 2003 from http://www.ltg.ed.ac.uk/NITE/.
Goodwin, C. (1981).Conversational organization: Interaction between speakers and hearers. New York: Academic Press.
Java foundationclasses: Cross-platform, GUIs and graphics (n.d., Revised April 12, 2003). Retrieved May 26, 2003 from http://java.sun.com/ products/jfc/.
Java Media Framework API (n.d., Revised May 6, 2003). Retrieved May 26, 2003, from http://java.sun.com/products/java-media/jmf/.
JDOM (n.d.). Retrieved May 26, 2003 from http://www.jdom.org/.
Kipp, M. (2001, September).Anvil: A generic annotation tool for multi-modal dialogue. Paper presented at the Seventh European Conference on Speech Communication and Technology (EUROSPEECH), Aalborg.
Language Technology Group (n.d., Revised March 30, 1998).LTG software: LT-POS. Retrieved May 26, 2003 from http://www.ltg. ed.ac.uk/software/pos/index.html.
McKelvie, D., Isard, A., Mengel, Ax., Møller, M. B., Grosse, M., &Klein, M. (2001). The MATE Workbench: An annotation tool for XML coded speech corpora.Speech Communication,33, 97–112.
Milde, J.-T., &Gut, U. (2001). The TASX-environment: An XML-based corpus database for time aligned language data. In S. Bird, P. Buneman, & M. Liberman (Eds.),Proceedings of the IRCS Workshop on Linguistic Databases (pp. 174–180). Philadelphia: University of Pennsylvania.
Noldus, L. P. J. J., Trienes, R.J.H., Hendriksen, A.H.M., Jansen, H., &Jansen, R. G. (2000). The Observer Video-Pro: New software for the collection, management, and presentation of time-structured data from videotapes and digital media files.Behavior Research Methods, Instruments, & Computers,32, 197–206.
Silverman, K., Beckman, M., Pitrelli, J., Ostendorf, M., Wightman, C., Price, P., Pierrehumbert, J., &Hirschberg, J. (1992). TOBI: A standard for labeling English prosody. InInternational Conference on Speech and Language Processing (ICSLP) (Vol. 2, pp. 867–870) Alberta: Permanent Councilfor the Organization of International Conferences on Spoken Language Processing.
Sjölander, K., & Beskow, J. (n.d., Revised May 9, 2003).Wavesurfer. Retrieved May 26, 2003 from http://www.speech.kth.se/wavesurfer/.
World Wide Web Consortium (n.d.-a).Extensible Markup Language (XML). Retrieved May 26, 2003 from http://www.w3.org/XML/.
World Wide Web Consortium (n.d.-b, Revised January 16, 2003).The Extensible Stylesheet Language (XSL). Retrieved May 26, 2003 from http://www.w3.org/Style/XSL/.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Carletta, J., Evert, S., Heid, U. et al. The NITE XML Toolkit: Flexible annotation for multimodal language data. Behavior Research Methods, Instruments, & Computers 35, 353–363 (2003). https://doi.org/10.3758/BF03195511
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/BF03195511