Skip to main content
Log in

Current approaches to punctuation in computational linguistics

  • Published:
Computers and the Humanities Aims and scope Submit manuscript

Abstract

Some recent studies in computational linguistics have aimed to take advantage of various cues presented by punctuation marks. This short survey is intended to summarise these research efforts and additionally, to outline a current perspective for the usage and functions of punctuation marks. We conclude by presenting an information-based framework for punctuation, influenced by treatments of several related phenomena in computational linguistics.

Varol Akman is a professor of computer engineering at Bilkent University, Ankara, Turkey. From 1980 to 1985, he was a Fulbright scholar at Rensselaer Polytechnic Institute, Troy, New York, where he received a PhD degree in computer engineering. Prior to joining Bilkent in 1988, he held a senior researcher position with the Centrum voor Wiskunde en Informatica, Amsterdam, the Netherlands. His current research areas include artificial intelligence models of context, computational aspects of situation theory, and in general, language and philosophy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Abbreviations

DRT:

discourse representation theory

DRS:

discourse representation structure

NLP:

natural language processing

NLG:

natural language generation

RST:

rhetorical structure theory

SDRT:

segmented discourse representation theory

SDRS:

segmented discourse representation structure

References

  • ACL/DCI. Association for Computational Linguistics Data Collection Initiative, CD-ROM 1, 1991. http://www.ldc.upenn.edu

  • Akram, Mohammed and A. M. Saadeddin. “Target-World Experiential Matching: The Case of Arabic/English Translating.” Quinquereme 10(2) (1987), 137–164.

    Google Scholar 

  • Asher, Nicholas. Reference to Abstract Objects in Discourse. Dordrecht, Netherlands: Kluwer, 1993.

    Google Scholar 

  • Bayraktar, Murat. Computer-Aided Analysis ofEnglish Punctuation on a Parsed Corpus: The Special Case of Comma. Master's thesis, Dept. of Computer Engineering and Information Science, Bilkent University, Ankara, Turkey, 1996.

    Google Scholar 

  • Bolinger, Dwight. Intonation and Its Uses: Melody in Grammar and Discourse. Stanford, California: Stanford University Press 1989.

    Google Scholar 

  • Briscoe, Ted and John Carroll. “Developing and Evaluating a Probabilistic LR Parser of Part-of-Speech and Punctuation Labels.” In Proceedings of International Workshop on Parsing Technologies. Prague, Czech Republic, 1995, pp. 48–58.

    Google Scholar 

  • Briscoe, Ted. Parsing (with) Punctuation. Technical report, Rank Xerox Research Centre, Grenoble, France, 1994.

    Google Scholar 

  • Briscoe, Ted. “The Syntax and Semantics of Punctuation and Its Use in Interpretation.” pp. 1–8. In (Jones, 1996a).

  • Chafe, Wallace. “Punctuation and the Prosody of Written Language.” Written Communication 5(4) (1988), 395–426.

    Google Scholar 

  • Cruttenden, Allen. Intonation.Cambridge, UK: Cambridge University Press, 1986.

    Google Scholar 

  • Dale, Robert. “Exploring the Role of Punctuation in the Signalling of Discourse Structure.” In Proceedings of a Workshop on Text Representation and Domain Modelling: Ideas from Linguistics and AI. Berlin, Germany: Technical University of Berlin, 1991a, pp. 110–120.

    Google Scholar 

  • Dale, Robert. “The Role of Punctuation in Discourse Structure.” In Working Notes for the AAAI Fall Symposium on Discourse Structure in Natural Language Understanding and Generation. Asilomar, CA, 1991b, pp. 13–14.

    Google Scholar 

  • Doran, Christine. “Punctuation in Quoted Speech.” pp. 9–18. In (Jones, 1996a).

  • Douglas, Shona and Matthew Hurst. “Layout and Language: Lists and Tables in Technical Documents.” pp. 19–24. In (Jones, 1996a).

  • Ehrlich, Eugene. Theory and Problems of Punctuation, Capitalization, and Spelling. Hong Kong: McGraw-Hill, 1992.

    Google Scholar 

  • Engdahl, Elisabet and Enric Vallduví. “The Linguistic Realization of Information Packaging.” Linguistics 34 (1996), 459–519.

    Google Scholar 

  • Fornell, Jan. “Punctuation in the Bravice English-to-Japanese Machine Translation System.” pp. 25–32. In (Jones, 1996a).

  • Francis, W. Nelson and Henry Kuěra. Frequency Analysis of English Usage: Lexicon and Grammar. Boston, MA: Houghton Mifflin, 1982.

    Google Scholar 

  • Garside, Roger, Geoffrey Leech and Geoffrey Sampson, Eds. The Computational Analysis of English. London: Longman, 1987.

    Google Scholar 

  • Grosz, Barbara J. and Candace L. Sidner. “Attention, Intentions, and the Structure of Discourse.” Computational Linguistics 12(3) (1986), 175–204.

    Google Scholar 

  • Hall, Nigel and Anne Robinson. The Punctuation Project. Manchester, UK: School of Education, Manchester Metropolitan University, 1996. http://bll.edu.aca.mmu.ac.uk/punctuation.html

    Google Scholar 

  • Harris, Roy. Signs of Writing. London, UK: Routledge, 1995.

    Google Scholar 

  • Henrichsen, Peter Juel. Does the Sentence Exist? Do We Need It? Unpublished Paper, Institute of Linguistics, University of Copenhagen, Copenhagen, Denmark, 1995.

    Google Scholar 

  • Hoffman, Beryl. “Integrating ‘Free’ Word Order Syntax and Information Structure.” In Proceedings of the 1995 Conference of the European Chapter of Association for Computational Linguistics. Dublin, Ireland, 1995, pp. 245–252.

    Google Scholar 

  • Humphreys, Lee. “Book Review: The Linguistics of Punctuation.” Machine Translation 7(1993), 199–201.

    Google Scholar 

  • Ince, Bahac Punctuation: The Special Case of Comma Categories. Senior Project Report, Dept. of Computer Engineering and Information Science, Bilkent University, Ankara, Turkey, 1996.

    Google Scholar 

  • Jackendoff, Ray. X-bar Syntax: A Study of Phrase Structure. Cambridge, MA: MIT Press, 1977.

    Google Scholar 

  • Jones, Bernard. Can Punctuation Help Parsing? Acquilex-II Working Paper 29, Computer Lab., Cambridge University, Cambridge, UK, 1994a.

    Google Scholar 

  • Jones, Bernard. “Exploring the Role of Punctuation in Parsing Natural Language.” In Proceedings of the 15th International Conference on Computational Linguistics (COLING '94). Kyoto, Japan, 1994b, pp. 421–425.

  • Jones, Bernard. “Exploring the Variety and Use of Punctuation.” In Proceedings of the 17th Annual Cognitive Science Conference. Pittsburgh, PA, 1995, pp. 619–624.

  • Jones, Bernard, Ed. Punctuation in Computational Linguistics. Santa Cruz, CA: UCSC. SIGPARSE 1996 (Post Conference Workshop of ACL96). Available from Human Communication Research Center, University of Edinburgh, UK, 1996a. http://www.cogsci.ed.ac.uk/hcrc/publications/wp-2.html

  • Jones, Bernard. “Towards a Syntactic Account of Punctuation.” In Proceedings of the 16th International Conference on Computational Linguistics (COLING '96). Copenhagen, Denmark, 1996b, pp. 604–609.

  • Jones, Bernard. “Towards Testing the Syntax of Punctuation.” In Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics-Student Session. Santa Cruz, CA, 1996c, pp. 363–365.

  • Jones, Bernard. What's the Point? A (Computational) Theory of Punctuation. PhD thesis, Centre for Cognitive Science, University of Edinburgh, Edinburgh, UK, 1997.

    Google Scholar 

  • Kamp, Hans and Uwe Reyle. From Discourse to Logic: Introduction to Modeltheoretic Semantics of Natural Language, Formal Logic and Discourse Representation Theory. Dordrecht, Netherlands: Kluwer, 1993.

    Google Scholar 

  • Karlsson, Fred, Atro Voutilainen, Juha Heikkila, and Arto Antilla, Eds. Constraint Grammar: A Language-Independent System for Parsing Unrestricted Text. Berlin, German: Mouton de Gruyter, 1994.

    Google Scholar 

  • Lee, Sherman. A Syntax and Semantics for Text Grammar. Master's thesis, Engineering Dept., Cambridge University, Cambridge, UK, 1995.

    Google Scholar 

  • Levinson, Joan Persily. Punctuation and the Orthographic Sentence: A Linguistic Analysis. PhD thesis, Dept. of Linguistics, City University of New York, NY, 1985.

    Google Scholar 

  • Mann, William C. and Sandra A. Thompson. Rhetorical Structure Theory: A Theory of Text Organization. Technical Report RS-87–190, USC Information Sciences Institute, University of Southern California, Marina Del Rey, CA, 1987.

    Google Scholar 

  • McDermott, John. Punctuation for Now. Hong Kong: MacMillan, 1990.

    Google Scholar 

  • Meyer, Charles F. A Linguistic Study of American Punctuation. PhD thesis, University of Wisconsin-Milwaukee, WI, 1983.

    Google Scholar 

  • Meyer, Charles R. “Punctuation Practice in the Brown Corpus.” ICAME Newsletter (1986), 80–95.

  • Meyer, Charles F. A Linguistic Study of American Punctuation. New York, NY: Peter Lang, 1987.

    Google Scholar 

  • Min, Young-Lie. “Role of Punctuation in Disambiguation of Coordinate Compounds.” pp. 33–40. In (Jones, 1996a).

  • Muskens, Reinhard. “Combining Montague Semantics and Discourse Representation.” Linguistics and Philosophy 19 (1996), 143–186.

    Google Scholar 

  • Nunberg, Geoffrey. The Linguistics of Punctuation. Number 18 in CSLI Lecture Notes. Stanford, CA: CSLI Publications, 1990.

    Google Scholar 

  • Oehrle, Richard T. Lecture Notes: Prosody, Information and Grammatical Architecture. Seventhth European Summer School in Logic, Language and Information, Barcelona, Spain, 1995.

  • Osborne, Miles. “Can Punctuation Help Learning?” In Connectionist, Statistical, and Symbolic Approaches to Learning for Natural Language Processing, Lecture Notes in Artificial Intelligence, Number 1040. Eds. Stefan Wermter, Ellen Riloff, and Gabriele Scheler. Berlin: Springer-Verlag, Berlin, 1996, pp. 399–412.

    Google Scholar 

  • Parkes, M. B. Pause and Effect: An Introduction to the History of Punctuation in the West. Berkeley, CA: University of California Press, 1993.

    Google Scholar 

  • Partridge, Eric. You Have a Point There: A Guide to Punctuation and its Allies. London, UK: Routledge, 1993.

    Google Scholar 

  • Pascual, Elsa and Jacques Virbel. “Semantic and Layout Properties of Text Punctuation.” pp. 41–47. In (Jones, 1996a).

  • Pereira, Fernando and David Warren. “Definite Clause Grammars for Language Analysis - A Survey of the Formalism and a Comparison with Augmented Transition Networks.” Artificial Intelligence 13(3) (1980), 231–278.

    Google Scholar 

  • Prevost, Scott and Mark Steedman. “Specifying Intonation from Context for Speech Synthesis.” Speech Communications 15 (1994), 139–153.

    Google Scholar 

  • Sampson, Geoffrey. “Book Review: The Linguistics of Punctuation.” Linguistics 30(2) (1992), 467–475.

    Google Scholar 

  • Sampson, Geoffrey. English for the Computer: The SUSANNE Corpus and Analytic Scheme. Oxford, UK: Oxford University Press, 1995.

    Google Scholar 

  • Say, Bilge and Varol Akman. “Information-Based Aspects of Punctuation.” pp. 49–56. In (Jones, 1996a).

  • Say, Bilge and Varol Akman. “An Information-Based Treatment of Punctuation in Discourse Representation Theory.” In Second International Conference on Mathematical Linguistics. Tarragona, Spain, 1996b.

    Google Scholar 

  • Say, Bilge and Varol Akman. Dashes as Cues to Discourse Structure. Manuscript. Dept. of Computer Engineering and Information Science, Bilkent University, Ankara, Turkey, 1997.

    Google Scholar 

  • Say, Bilge. An Information-Based Approach to Punctuation. PhD Proposal, Dept. of Computer Engineering and Information Science, Bilkent University, Ankara, Turkey, 1995. http://www.cs.bilkent.edu.trrsay/bilge.html

    Google Scholar 

  • Schiffrin, Deborah. Discourse Markers. Cambridge, UK: Cambridge University Press, 1987.

    Google Scholar 

  • Scholes, Robert J. and Brenda J. Willis. “Prosodic and Syntactic Functions of Punctuation — A Contribution to the Study of Orality and Literacy.” Interchange 21(3) (1990), 13–20.

    Google Scholar 

  • Shiuan, Peh Li and Christopher Ting Hian Ann. “A Divide-and-Conquer Strategy for Parsing.” pp. 57–66. In (Jones, 1996a). Simard, Marthe. “Considerations on Parsing a Poorly Punctuated Text in French.” pp. 67–72. In (Jones, 1996a).

  • Smith, Carolena L. “Attitudinal Study of Graphic Computer-Based Instruction for Punctuation.” Journal of Technical Writing and Communication 3 (1986), 267–272.

    Google Scholar 

  • Srinivasan, V. “Punctuation and Parsing of Real-World Texts.” In Proceedings of the Sixth Twente Workshop on Language Technologies. Eds. K. Sikkel and A. Nijholt. Enschede, Netherlands, 1991, pp. 163–167.

    Google Scholar 

  • Steedman, Mark. “Structure and Intonation.” Language 67(2) (1991), 260–296.

    Google Scholar 

  • Taylor, Lita J. and Gerry Knowles. Manual ofInformation to Accompany the SEC Corpus. Lancaster, UK: University of Lancaster, 1988.

    Google Scholar 

  • Twine, Nanette. “The Adoption of Punctuation in Japanese Script.” Visible Language 18(3) (1984), 229–237.

    Google Scholar 

  • Vallduvf, Enric. The Informational Component. Garland, New York, 1992.

    Google Scholar 

  • White, Micheal. “Presenting Punctuation.” In Proceedings of the Fifth European Workshop on Natural Language Generation. Leiden, Netherlands, 1995, pp. 107–125.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to V. Akman.

Additional information

Bilge Say received her BS in Computer Engineering from Middle East Technical University, Ankara, Turkey, in 1990, and her MS in Computation from Oxford University, Oxford, UK, in 1991. She worked two years as a systems support engineer in the industry. Currently, she is a PhD student at Bilkent University, Ankara, Turkey, studying the information-based aspects of punctuation. She has recently conducted research at the Computer Lab of Cambridge University, Cambridge, UK, on an extended visit.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Say, B., Akman, V. Current approaches to punctuation in computational linguistics. Comput Hum 30, 457–469 (1996). https://doi.org/10.1007/BF00057941

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF00057941

Key words

Navigation