Distinguishing Properties and Relations in the Denotation of Adjectives: An Empirical Investigation

Hartung, Matthias; Frank, Anette

doi:10.1007/978-3-319-01541-5_8

Matthias Hartung⁸ &
Anette Frank⁸

Part of the book series: Studies in Linguistics and Philosophy ((SLAP,volume 94))

983 Accesses

Abstract

We empirically investigate the task of classifying adjectives into property-denoting vs. relational types, a distinction that is highly relevant for ontology learning. The feasibility of this task is evaluated in two experiments: (i) a corpus study based on human annotations and (ii) an automatic classification experiment. We observe that token-level annotation of these classes is expensive and difficult. Yet, a careful corpus analysis reveals that adjective classes tend to be stable on the type level, with few occurrences of class shifts observed at the token level. As a consequence, we opt for an automatic classification approach that operates on the type level. Training on heuristically labeled data yields high classification performance on our own data and on a data set compiled from WordNet. Our results indicate that it is feasible to automatically distinguish property-denoting and relational adjectives, even if only small amounts of annotated data are available. A combination of semantic, morphological and shallow syntactic features turns out to be most informative for the task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
A strict comparison of the two approaches will not be possible due to the different languages considered and divergences regarding the selected target classes.
2.
We used version 3 of the BNC XML Edition, available from: http://www.natcorp.ox.ac.uk/
3.
Part of speech tagging was the primary source of errors here.
4.
In its original statement, the notion of foundation is defined as follows: “For a concept α to be founded on another concept β, any instance χ of α has to be necessarily associated to an instance ϕ of β which is not related to χ by a part-of relation” (Guarino, 1992). We extend this notion from concepts to properties, arguing that event-based adjectives denote founded properties that are necessarily associated with an implicit event.
5.
Note that these patterns are mutually exclusive: (5a) applies to examples such as comfortable chair and interesting article in (2b) and (2c), where ENT fills the patient role of EVENT. In contrast, eloquent person in (2a) can be identified as event-based by (5b) only, as ENT acts as the agent of EVENT here (cf. Lapata, 2001). We expect that disambiguating basic and event-related readings should work best if (5a) is constrained such that EVENT may not be instantiated by perception verbs such as look, feel, taste etc.
6.
κ measures the agreement among annotators on classification tasks. Its values range between 0 (no agreement at all) and 1 (perfect agreement), reflecting the degree of agreement above chance (Fleiss, 1971).
7.
Henceforth, we will refer to these binary classes as ATTR(ibutive) and REL(ational).
8.
In a selective investigation on more representative data, class volatility turns out to be only slightly higher (cf. Sect. 8.6.1).
9.
The abbreviations used in the table denote part-of-speech tags according to the Penn Treebank nomenclature (Marcus et al., 1993).
10.
Note that the pertainymy relation in WordNet is uni-directional as it contains only links from adjectives to their morphological base nouns, but not from derived nouns to base adjectives. For instance, cultural and culture or dental and tooth are linked by pertainymy, while no such link exists between short and shortness.

References

Almuhareb, Abdulrahman. 2006. Attributes in lexical acquisition. Ph.D. dissertation, Department of Computer Science, University of Essex.
Google Scholar
Almuhareb, Abdulrahman, and Massimo Poesio. 2004. Attribute-based and value-based clustering. An evaluation. In Proceedings of the 2004 conference on empirical methods in natural language processing, Barcelona, 158–165.
Google Scholar
Amoia, Marilisa, and Claire Gardent. 2008. A test suite for inference involving adjectives. In Proceedings of the 6th international conference on language resources and evaluation, Marrakech, 631–637.
Google Scholar
Baayen, R.H., R. Piepenbrock, and L. Gulikers. 1996. CELEX2. Philadelphia: Linguistic Data Consortium.
Google Scholar
Baroni, Marco, Silvia Bernardini, Adriano Ferraresi, and Eros Zanchetta. 2009. The wacky wide web: A collection of very large linguistically processed web-crawled corpora. Journal of Language Resources and Evaluation 43(3): 209–226.
Article Google Scholar
Barsalou, Lawrence W. 1992. Frames, concepts and conceptual fields. In Frames, Fields and Contrasts, ed. A. Lehrer and E.F. Kittay, 21–74. Hillsdale: Erlbaum.
Google Scholar
Batista, Gustavo, Ronaldo Prati, and Maria Carolina Monard. 2004. A study of the behavior of several methods for balancing machine learning training data. SIGKDD Explorations 6: 20–29.
Article Google Scholar
Beesley, Kenneth R. 1982. Evaluative adjectives as one-place predicates in Montague grammar. Journal of Semantics 1(3): 195–249.
Article Google Scholar
Boleda, Gemma. 2006. Automatic acquisition of semantic classes for adjectives. Ph.D. dissertation, Pompeu Fabra University, Barcelona.
Google Scholar
Buitelaar, Paul, Philipp Cimiano, and Bernardo Magnini. 2005. Ontology learning from text. An overview. In Ontology learning from text. Methods, evaluation and applications, ed. Paul Buitelaar, Philipp Cimiano, and Bernardo Magnini, 3–12. IOS Press: Amsterdam.
Google Scholar
Cimiano, Philipp. 2006. Ontology learning and population from text. Algorithms, evaluation and applications. New York/London: Springer.
Google Scholar
Ciravegna, F. 2000. Challenges in information extraction from texts for knowledge management. IEEE Intelligent Systems 16(6): 84–86.
Google Scholar
Fellbaum, Christiane, ed. 1998. WordNet: An electronic lexical database. Cambridge: MIT.
Google Scholar
Fleiss, Joseph L. 1971. Measuring nominal scale agreement among many raters. Psychological Bulletin 76(5): 378–382.
Article Google Scholar
Guarino, Nicola. 1992. Concepts, attributes and arbitrary relations. Data & Knowledge Engineering 8: 249–261.
Article Google Scholar
Hartung, Matthias, and Anette Frank. 2010a. A semi-supervised type-based classification of adjectives. Distinguishing properties and relations. In Proceedings of the 7th international conference on language resources and evaluation, Valletta, 1029–1036.
Google Scholar
Hartung, Matthias, and Anette Frank. 2010b. A structured vector space model for hidden attribute meaning in adjective-noun phrases. In Proceedings of the 23rd international conference on computational linguistics (COLING), Beijing, 430–438.
Google Scholar
Hartung, Matthias, and Anette Frank. 2011. Exploring supervised LDA models for assigning attributes to adjective-noun phrases. In Proceedings of the 2011 conference on empirical methods in natural language processing, Edinburgh, 540–551.
Google Scholar
Hatzivassiloglou, Vasileios, and Kathleen R. McKeown. 1993. Towards the automatic identification of adjectival scales: Clustering adjectives according to meaning. In Proceedings of the 31st annual meeting of the association for computational linguistics, Columbus, 172–182.
Google Scholar
Kamp, Hans. 1975. Two theories about adjectives. In Formal semantics of natural language, ed. E.L. Keenan, 123–155. Cambridge: Cambridge University Press.
Chapter Google Scholar
Lapata, Mirella. 2001. The acquisition and modeling of lexical knowledge. A corpus-based investigation of systematic polysemy. Ph.D. dissertation, University of Edinburgh.
Google Scholar
Levin, Beth. 1993. English verb classes and alternations. A preliminary investigation. Chicago: University of Chicago Press.
Google Scholar
Levinson, Stephen C. 1983. Pragmatics. Cambridge: Cambridge University Press.
Google Scholar
Marcus, Mitchell P., Mary Ann Marcinkiewicz, and Beatrice Santorini. 1993. Building a large annotated corpus of English. The Penn Treebank. Computational Linguistics 19(2): 313–330.
Google Scholar
McNemar, Quinn. 1947. Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 12(2): 153–157.
Article Google Scholar
Miller, Katherine J. 1998. Modifiers in WordNet. In WordNet. An electronic lexical database, ed. Christiane Fellbaum, 47–67. Cambridge: MIT.
Google Scholar
Miyao, Yusuke, and Jun’ichi Tsujii. 2009. Supervised learning of a probabilistic lexicon of verb semantic classes. In Proceedings of the 2009 conference on empirical methods in natural language processing, Singapore, 1328–1337.
Google Scholar
Montague, Richard. 1974. English as a formal language. In Formal philosophy, ed. R.H. Thomason, 247–270. New Haven: Yale University Press.
Google Scholar
Pinto, H., and J. Martins. 2004. Ontologies. How can they be built? Knowledge and Information Systems 6(4): 441–464.
Google Scholar
Raskin, Victor, and Sergei Nirenburg. 1998. An applied ontological semantic microtheory of adjective meaning for natural language processing. Machine Translation 13: 135–227.
Article Google Scholar
Witten, Ian H., and Eibe Frank. 2005. Data mining. Practical machine learning tools and techniques, 2nd ed. San Francisco: Morgan Kaufmann.
Google Scholar

Download references

Acknowledgements

The authors thank the annotators for their efforts and three anonymous reviewers of the final and an earlier version of this paper for highly valuable comments. As this article is a revised and extended version of Hartung and Frank (2010a), we also gratefully acknowledge the permission of the European Language Resources Association (ELRA) to republish the material.

Author information

Authors and Affiliations

Computational Linguistics Department, Heidelberg University, Im Neuenheimer Feld 325, 69120, Heidelberg, Germany
Matthias Hartung & Anette Frank

Authors

Matthias Hartung
View author publications
You can also search for this author in PubMed Google Scholar
Anette Frank
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Matthias Hartung .

Editor information

Editors and Affiliations

Department of Linguistics and Information Science, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
Thomas Gamerschlag
Department of Linguistics and Information Science, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
Doris Gerland
Department of Linguistics and Information Science, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
Rainer Osswald
Department of Linguistics and Information Science, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
Wiebke Petersen

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Hartung, M., Frank, A. (2014). Distinguishing Properties and Relations in the Denotation of Adjectives: An Empirical Investigation. In: Gamerschlag, T., Gerland, D., Osswald, R., Petersen, W. (eds) Frames and Concept Types. Studies in Linguistics and Philosophy, vol 94. Springer, Cham. https://doi.org/10.1007/978-3-319-01541-5_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-01541-5_8
Published: 07 November 2013
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01540-8
Online ISBN: 978-3-319-01541-5
eBook Packages: Humanities, Social Sciences and LawSocial Sciences (R0)

Publish with us

Policies and ethics