Abstract
Some recent studies in computational linguistics have aimed to take advantage of various cues presented by punctuation marks. This short survey is intended to summarise these research efforts and additionally, to outline a current perspective for the usage and functions of punctuation marks. We conclude by presenting an information-based framework for punctuation, influenced by treatments of several related phenomena in computational linguistics.
Varol Akman is a professor of computer engineering at Bilkent University, Ankara, Turkey. From 1980 to 1985, he was a Fulbright scholar at Rensselaer Polytechnic Institute, Troy, New York, where he received a PhD degree in computer engineering. Prior to joining Bilkent in 1988, he held a senior researcher position with the Centrum voor Wiskunde en Informatica, Amsterdam, the Netherlands. His current research areas include artificial intelligence models of context, computational aspects of situation theory, and in general, language and philosophy.
Similar content being viewed by others
Abbreviations
- DRT:
-
discourse representation theory
- DRS:
-
discourse representation structure
- NLP:
-
natural language processing
- NLG:
-
natural language generation
- RST:
-
rhetorical structure theory
- SDRT:
-
segmented discourse representation theory
- SDRS:
-
segmented discourse representation structure
References
ACL/DCI. Association for Computational Linguistics Data Collection Initiative, CD-ROM 1, 1991. http://www.ldc.upenn.edu
Akram, Mohammed and A. M. Saadeddin. “Target-World Experiential Matching: The Case of Arabic/English Translating.” Quinquereme 10(2) (1987), 137–164.
Asher, Nicholas. Reference to Abstract Objects in Discourse. Dordrecht, Netherlands: Kluwer, 1993.
Bayraktar, Murat. Computer-Aided Analysis ofEnglish Punctuation on a Parsed Corpus: The Special Case of Comma. Master's thesis, Dept. of Computer Engineering and Information Science, Bilkent University, Ankara, Turkey, 1996.
Bolinger, Dwight. Intonation and Its Uses: Melody in Grammar and Discourse. Stanford, California: Stanford University Press 1989.
Briscoe, Ted and John Carroll. “Developing and Evaluating a Probabilistic LR Parser of Part-of-Speech and Punctuation Labels.” In Proceedings of International Workshop on Parsing Technologies. Prague, Czech Republic, 1995, pp. 48–58.
Briscoe, Ted. Parsing (with) Punctuation. Technical report, Rank Xerox Research Centre, Grenoble, France, 1994.
Briscoe, Ted. “The Syntax and Semantics of Punctuation and Its Use in Interpretation.” pp. 1–8. In (Jones, 1996a).
Chafe, Wallace. “Punctuation and the Prosody of Written Language.” Written Communication 5(4) (1988), 395–426.
Cruttenden, Allen. Intonation.Cambridge, UK: Cambridge University Press, 1986.
Dale, Robert. “Exploring the Role of Punctuation in the Signalling of Discourse Structure.” In Proceedings of a Workshop on Text Representation and Domain Modelling: Ideas from Linguistics and AI. Berlin, Germany: Technical University of Berlin, 1991a, pp. 110–120.
Dale, Robert. “The Role of Punctuation in Discourse Structure.” In Working Notes for the AAAI Fall Symposium on Discourse Structure in Natural Language Understanding and Generation. Asilomar, CA, 1991b, pp. 13–14.
Doran, Christine. “Punctuation in Quoted Speech.” pp. 9–18. In (Jones, 1996a).
Douglas, Shona and Matthew Hurst. “Layout and Language: Lists and Tables in Technical Documents.” pp. 19–24. In (Jones, 1996a).
Ehrlich, Eugene. Theory and Problems of Punctuation, Capitalization, and Spelling. Hong Kong: McGraw-Hill, 1992.
Engdahl, Elisabet and Enric Vallduví. “The Linguistic Realization of Information Packaging.” Linguistics 34 (1996), 459–519.
Fornell, Jan. “Punctuation in the Bravice English-to-Japanese Machine Translation System.” pp. 25–32. In (Jones, 1996a).
Francis, W. Nelson and Henry Kuěra. Frequency Analysis of English Usage: Lexicon and Grammar. Boston, MA: Houghton Mifflin, 1982.
Garside, Roger, Geoffrey Leech and Geoffrey Sampson, Eds. The Computational Analysis of English. London: Longman, 1987.
Grosz, Barbara J. and Candace L. Sidner. “Attention, Intentions, and the Structure of Discourse.” Computational Linguistics 12(3) (1986), 175–204.
Hall, Nigel and Anne Robinson. The Punctuation Project. Manchester, UK: School of Education, Manchester Metropolitan University, 1996. http://bll.edu.aca.mmu.ac.uk/punctuation.html
Harris, Roy. Signs of Writing. London, UK: Routledge, 1995.
Henrichsen, Peter Juel. Does the Sentence Exist? Do We Need It? Unpublished Paper, Institute of Linguistics, University of Copenhagen, Copenhagen, Denmark, 1995.
Hoffman, Beryl. “Integrating ‘Free’ Word Order Syntax and Information Structure.” In Proceedings of the 1995 Conference of the European Chapter of Association for Computational Linguistics. Dublin, Ireland, 1995, pp. 245–252.
Humphreys, Lee. “Book Review: The Linguistics of Punctuation.” Machine Translation 7(1993), 199–201.
Ince, Bahac Punctuation: The Special Case of Comma Categories. Senior Project Report, Dept. of Computer Engineering and Information Science, Bilkent University, Ankara, Turkey, 1996.
Jackendoff, Ray. X-bar Syntax: A Study of Phrase Structure. Cambridge, MA: MIT Press, 1977.
Jones, Bernard. Can Punctuation Help Parsing? Acquilex-II Working Paper 29, Computer Lab., Cambridge University, Cambridge, UK, 1994a.
Jones, Bernard. “Exploring the Role of Punctuation in Parsing Natural Language.” In Proceedings of the 15th International Conference on Computational Linguistics (COLING '94). Kyoto, Japan, 1994b, pp. 421–425.
Jones, Bernard. “Exploring the Variety and Use of Punctuation.” In Proceedings of the 17th Annual Cognitive Science Conference. Pittsburgh, PA, 1995, pp. 619–624.
Jones, Bernard, Ed. Punctuation in Computational Linguistics. Santa Cruz, CA: UCSC. SIGPARSE 1996 (Post Conference Workshop of ACL96). Available from Human Communication Research Center, University of Edinburgh, UK, 1996a. http://www.cogsci.ed.ac.uk/hcrc/publications/wp-2.html
Jones, Bernard. “Towards a Syntactic Account of Punctuation.” In Proceedings of the 16th International Conference on Computational Linguistics (COLING '96). Copenhagen, Denmark, 1996b, pp. 604–609.
Jones, Bernard. “Towards Testing the Syntax of Punctuation.” In Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics-Student Session. Santa Cruz, CA, 1996c, pp. 363–365.
Jones, Bernard. What's the Point? A (Computational) Theory of Punctuation. PhD thesis, Centre for Cognitive Science, University of Edinburgh, Edinburgh, UK, 1997.
Kamp, Hans and Uwe Reyle. From Discourse to Logic: Introduction to Modeltheoretic Semantics of Natural Language, Formal Logic and Discourse Representation Theory. Dordrecht, Netherlands: Kluwer, 1993.
Karlsson, Fred, Atro Voutilainen, Juha Heikkila, and Arto Antilla, Eds. Constraint Grammar: A Language-Independent System for Parsing Unrestricted Text. Berlin, German: Mouton de Gruyter, 1994.
Lee, Sherman. A Syntax and Semantics for Text Grammar. Master's thesis, Engineering Dept., Cambridge University, Cambridge, UK, 1995.
Levinson, Joan Persily. Punctuation and the Orthographic Sentence: A Linguistic Analysis. PhD thesis, Dept. of Linguistics, City University of New York, NY, 1985.
Mann, William C. and Sandra A. Thompson. Rhetorical Structure Theory: A Theory of Text Organization. Technical Report RS-87–190, USC Information Sciences Institute, University of Southern California, Marina Del Rey, CA, 1987.
McDermott, John. Punctuation for Now. Hong Kong: MacMillan, 1990.
Meyer, Charles F. A Linguistic Study of American Punctuation. PhD thesis, University of Wisconsin-Milwaukee, WI, 1983.
Meyer, Charles R. “Punctuation Practice in the Brown Corpus.” ICAME Newsletter (1986), 80–95.
Meyer, Charles F. A Linguistic Study of American Punctuation. New York, NY: Peter Lang, 1987.
Min, Young-Lie. “Role of Punctuation in Disambiguation of Coordinate Compounds.” pp. 33–40. In (Jones, 1996a).
Muskens, Reinhard. “Combining Montague Semantics and Discourse Representation.” Linguistics and Philosophy 19 (1996), 143–186.
Nunberg, Geoffrey. The Linguistics of Punctuation. Number 18 in CSLI Lecture Notes. Stanford, CA: CSLI Publications, 1990.
Oehrle, Richard T. Lecture Notes: Prosody, Information and Grammatical Architecture. Seventhth European Summer School in Logic, Language and Information, Barcelona, Spain, 1995.
Osborne, Miles. “Can Punctuation Help Learning?” In Connectionist, Statistical, and Symbolic Approaches to Learning for Natural Language Processing, Lecture Notes in Artificial Intelligence, Number 1040. Eds. Stefan Wermter, Ellen Riloff, and Gabriele Scheler. Berlin: Springer-Verlag, Berlin, 1996, pp. 399–412.
Parkes, M. B. Pause and Effect: An Introduction to the History of Punctuation in the West. Berkeley, CA: University of California Press, 1993.
Partridge, Eric. You Have a Point There: A Guide to Punctuation and its Allies. London, UK: Routledge, 1993.
Pascual, Elsa and Jacques Virbel. “Semantic and Layout Properties of Text Punctuation.” pp. 41–47. In (Jones, 1996a).
Pereira, Fernando and David Warren. “Definite Clause Grammars for Language Analysis - A Survey of the Formalism and a Comparison with Augmented Transition Networks.” Artificial Intelligence 13(3) (1980), 231–278.
Prevost, Scott and Mark Steedman. “Specifying Intonation from Context for Speech Synthesis.” Speech Communications 15 (1994), 139–153.
Sampson, Geoffrey. “Book Review: The Linguistics of Punctuation.” Linguistics 30(2) (1992), 467–475.
Sampson, Geoffrey. English for the Computer: The SUSANNE Corpus and Analytic Scheme. Oxford, UK: Oxford University Press, 1995.
Say, Bilge and Varol Akman. “Information-Based Aspects of Punctuation.” pp. 49–56. In (Jones, 1996a).
Say, Bilge and Varol Akman. “An Information-Based Treatment of Punctuation in Discourse Representation Theory.” In Second International Conference on Mathematical Linguistics. Tarragona, Spain, 1996b.
Say, Bilge and Varol Akman. Dashes as Cues to Discourse Structure. Manuscript. Dept. of Computer Engineering and Information Science, Bilkent University, Ankara, Turkey, 1997.
Say, Bilge. An Information-Based Approach to Punctuation. PhD Proposal, Dept. of Computer Engineering and Information Science, Bilkent University, Ankara, Turkey, 1995. http://www.cs.bilkent.edu.trrsay/bilge.html
Schiffrin, Deborah. Discourse Markers. Cambridge, UK: Cambridge University Press, 1987.
Scholes, Robert J. and Brenda J. Willis. “Prosodic and Syntactic Functions of Punctuation — A Contribution to the Study of Orality and Literacy.” Interchange 21(3) (1990), 13–20.
Shiuan, Peh Li and Christopher Ting Hian Ann. “A Divide-and-Conquer Strategy for Parsing.” pp. 57–66. In (Jones, 1996a). Simard, Marthe. “Considerations on Parsing a Poorly Punctuated Text in French.” pp. 67–72. In (Jones, 1996a).
Smith, Carolena L. “Attitudinal Study of Graphic Computer-Based Instruction for Punctuation.” Journal of Technical Writing and Communication 3 (1986), 267–272.
Srinivasan, V. “Punctuation and Parsing of Real-World Texts.” In Proceedings of the Sixth Twente Workshop on Language Technologies. Eds. K. Sikkel and A. Nijholt. Enschede, Netherlands, 1991, pp. 163–167.
Steedman, Mark. “Structure and Intonation.” Language 67(2) (1991), 260–296.
Taylor, Lita J. and Gerry Knowles. Manual ofInformation to Accompany the SEC Corpus. Lancaster, UK: University of Lancaster, 1988.
Twine, Nanette. “The Adoption of Punctuation in Japanese Script.” Visible Language 18(3) (1984), 229–237.
Vallduvf, Enric. The Informational Component. Garland, New York, 1992.
White, Micheal. “Presenting Punctuation.” In Proceedings of the Fifth European Workshop on Natural Language Generation. Leiden, Netherlands, 1995, pp. 107–125.
Author information
Authors and Affiliations
Corresponding author
Additional information
Bilge Say received her BS in Computer Engineering from Middle East Technical University, Ankara, Turkey, in 1990, and her MS in Computation from Oxford University, Oxford, UK, in 1991. She worked two years as a systems support engineer in the industry. Currently, she is a PhD student at Bilkent University, Ankara, Turkey, studying the information-based aspects of punctuation. She has recently conducted research at the Computer Lab of Cambridge University, Cambridge, UK, on an extended visit.
Rights and permissions
About this article
Cite this article
Say, B., Akman, V. Current approaches to punctuation in computational linguistics. Comput Hum 30, 457–469 (1996). https://doi.org/10.1007/BF00057941
Issue Date:
DOI: https://doi.org/10.1007/BF00057941