Abstract
The tuples in a generalized relation (i.e., a summary generated from a database) are unique, and therefore, can be considered to be a population with a structure that can be described by some frequency or probability distribution based upon the values contained in the derived Count attribute. In this chapter, we describe sixteen diversity measures that evaluate the frequency or probability distribution of the values in the derived Count attribute in a summary to assign a single real-valued index that represents its interestingness relative to other summaries generated from the same database. The measures are well-known measures of dispersion, dominance, inequality, and concentration that have previously been successfully and frequently applied in several areas of the physical, social, ecological, management, information, and computer sciences. Their use for ranking summaries generated from databases is a new application area.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer Science+Business Media New York
About this chapter
Cite this chapter
Hilderman, R.J., Hamilton, H.J. (2001). Heuristic Measures of Interestingness. In: Knowledge Discovery and Measures of Interest. The Springer International Series in Engineering and Computer Science, vol 638. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3283-2_4
Download citation
DOI: https://doi.org/10.1007/978-1-4757-3283-2_4
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-4913-4
Online ISBN: 978-1-4757-3283-2
eBook Packages: Springer Book Archive