Abstract
Coordination is a linguistic phenomenon where two or more terms or phrases, called conjuncts, are conjoined by a coordinating conjunction, such as and, or, or but. Well-formed coordination structures seem to require that the conjuncts are semantically similar or related. In this paper, we utilize English corpus data to examine the semantic constraints on syntactically like coordinations, which link constituents with the same lexical or syntactic categories. We examine the extent to which these semantic constraints depend on the type of conjunction or on the lexical or syntactic category of the conjuncts. We employ two distinct, independent metrics to measure the semantic similarity of conjuncts: WordNet relations and semantic word embeddings. Our results indicate that both measures of similarity have varying distributions depending on the particular conjunction and the conjuncts’ lexical or syntactic categories.
Keywords
- Coordination
- Corpus linguistics
- Semantics
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
We focus on the coordination of lexical categories like nouns, verbs, and adjectives, as well as syntactic (phrasal) categories such as noun phrases, verb phrases and adjective phrases. For simplicity, we refer to both as categories in this paper.
- 2.
UD v2 also handles shared modifiers, such as the adjective old in “old men and women,” using a distinct type of annotation.
- 3.
Our code is available at https://github.com/jkallini/SemanticCoordinationAnalysis.
- 4.
To avoid potential false positives for synonymy, we filter out coordinations in which both conjuncts have the same lemma, as in “he ran faster and faster.”
- 5.
References
Behzad, S., Zeldes, A.: A cross-genre ensemble approach to robust Reddit part of speech tagging. In: Proceedings of the 12th Web as Corpus Workshop (WAC-XII), pp. 50–56 (2020)
Fellbaum, C.: Co-occurrence and antonymy. Int. J. Lexicogr. 8(4), 281–303 (1995)
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
Haspelmath, M.: Coordinating constructions: an overview. In: Coordinating Constructions. Typological Studies in Language, vol. 58, pp. 3–39. John Benjamins Publishing Company (2004). https://doi.org/10.1075/tsl.58.03has
Haspelmath, M.: Coordination. In: Shopen, T. (ed.) Language Typology and Syntactic Description, vol. 2, 2 edn., chap. 1. Cambridge University Press (2007). https://doi.org/10.1017/CBO9780511619434
Justeson, J.S., Katz, S.M.: Co-occurrences of antonymous adjectives and their contexts. Comput. Linguist. 17(1), 1–19 (1991)
Kallini, J., Fellbaum, C.: A corpus-based syntactic analysis of two-termed unlike coordination. In: Findings of the Association for Computational Linguistics: EMNLP 2021, pp. 3998–4008. Association for Computational Linguistics, Punta Cana, Dominican Republic, November 2021. https://doi.org/10.18653/v1/2021.findings-emnlp.335, https://aclanthology.org/2021.findings-emnlp.335
Mikolov, T., Chen, K., Corrado, G.S., Dean, J.: Efficient estimation of word representations in vector space (2013). https://arxiv.org/abs/1301.3781
Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Neural Information Processing Systems (NeurIPS) (2013). https://papers.nips.cc/paper/5021-distributed-representations-of-words-and-phrases-and-their-compositionality.pdf
Miller, G.A.: Wordnet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)
Nivre, J., et al.: Universal dependencies v2: an evergrowing multilingual treebank collection. In: Proceedings of the 12th Language Resources and Evaluation Conference, pp. 4034–4043. European Language Resources Association, Marseille, France, May 2020. https://aclanthology.org/2020.lrec-1.497
Prażmowska, A.: Is unlike coordination against the law (of the coordination of likes)? (2015)
Quirk, R., Greenbaum, S., Leech, G., Svartvik, J.: A Comprehensive Grammar of the English Language. Longman, London (1985)
Sag, I.A., Gazdar, G., Wasow, T., Weisler, S.: Coordination and how to distinguish categories. Nat. Lang. Linguistic Theory 3(2), 117–171 (1985). https://www.jstor.org/stable/4047644
Silveira, N., et al.: A gold standard dependency corpus for English. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014), pp. 2897–2904. European Language Resources Association (ELRA), Reykjavik, Iceland, May 2014. https://www.lrec-conf.org/proceedings/lrec2014/pdf/1089_Paper.pdf
Stenström, E.: CoNLL-U parser. https://github.com/EmilStenstrom/conllu/ (2021)
Williams, E.S.: Transformationless grammar. In: Linguistic Inquiry, vol. 12, pp. 645–653 (1981)
Zeldes, A.: The GUM corpus: creating multilayer resources in the classroom. Lang. Resour. Eval. 51(3), 581–612 (2016). https://doi.org/10.1007/s10579-016-9343-x
Zeman, D., et al.: CoNLL 2017 shared task: multilingual parsing from raw text to universal dependencies. In: Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pp. 1–19. Association for Computational Linguistics, Vancouver, Canada, August 2017. https://doi.org/10.18653/v1/K17-3001, https://aclanthology.org/K17-3001
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Kallini, J., Fellbaum, C. (2022). Computational Approaches for Understanding Semantic Constraints on Two-termed Coordination Structures. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech, and Dialogue. TSD 2022. Lecture Notes in Computer Science(), vol 13502. Springer, Cham. https://doi.org/10.1007/978-3-031-16270-1_6
Download citation
DOI: https://doi.org/10.1007/978-3-031-16270-1_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-16269-5
Online ISBN: 978-3-031-16270-1
eBook Packages: Computer ScienceComputer Science (R0)