Abstract
We consider linguistic data (base) exemplified by, for a personnel database, “most employees are young and well paid” (with some degree of truth) and their extensions as a very general tool for a human consistent summarization of large data sets. We advocate the use of the concept of Zadeh’s protoforms (prototypical forms) of linguistic summaries, in their more advanced form proposed by Kacprzyk and Zadro zny. Then, we present an extension of our interactive approach to the generation of linguistic summaries through the use of our fuzzy querying interface supporting queries with fuzzy linguistic quantifiers. We concentrate on a specific type of linguistic summaries which parallel specific fuzzy association rules, and show the use of an efficient algorithm for mining such rules. We show an extension to the dynamic case of by using linguistic summaries of times series data. As an example we show the use of linguistic summaries for Web server log analysis, in both the static and dynamic case. The results can be useful for the design and maintenance of computer networks.
To the memory of Professor Da Ruan, a younger colleague and a close friend, who has been able to always maintain a proper balance between a formal and analytic elegance, and practical usefulness, and feel of what really matters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zadeh LA, Kacprzyk J (eds.) (1999) Computing with words in information/intelligent systems. 1 Foundations, 2 Applications. Springer, Heidelberg
Yager RR (1982) A new approach to the summarization of data. Inf Sci 28:69–86
Kacprzyk J, Yager RR (2001) Linguistic summaries of data using fuzzy logic. Int J Gen Syst 30:133–154
Kacprzyk J, Yager RR, Zadrozny S (2000) A fuzzy logic based approach to linguistic summaries of databases. Int J Appl Math Comput Sci 10:813–834
Zadrozny S, Kacprzyk J (1999) On database summarization using a fuzzy querying interface. In: Proceedings of IFSA 99 world congress (Taipei, Taiwan R.O.C.). 1:39–43
Kacprzyk J, Zadrozny S (1998) Data mining via linguistic summaries of data: an interactive approach. In: Yamakawa T, Matsumoto G (eds.) Methodologies for the conception, design and application of soft computing—Proceedings of IIZUKA 98. Iizuka, Japan 668–671
Kacprzyk J, Zadrozny S (2001) On linguistic approaches in flexible querying and mining of association rulet. In: Larsen HL, Kacprzyk J, Zadrozny S, Andreasen T, Christiansen H (eds.) Flexible query answering systems. Recent advances, Springer, Heidelberg 475–484
Kacprzyk J, Zadrozny S (2001) Data mining via linguistic summaries of databases: an interactive approach. In: Ding L (ed.) A new paradigm of knowledge engineering by soft, computing. World Scientific, Singapore 325–345
Kacprzyk J, Zadrozny S (2001) Fuzzy linguistic summaries via association rulet. In: Kandel A, Last M, Bunke H (eds.) Data mining and computational intelligence. Springer, Heidelberg 115–139
Kacprzyk J, Zadrozny S (2001) Protoforms of linguistic data summaries: towards more general natural-language-based data mining tools. In: Abraham A, Ruiz del Solar J, Koeppen M (eds.) Soft Computing System. IOS Press, Amsterdam 417–425
Kacprzyk J, Zadrozny S (2005) Linguistic database summaries and their protoforms: towards natural language based knowledge discovery tools. Inf Sci 173(4):281–304
Kacprzyk J, Zadrozny S (1995) FQUERYfor Access: fuzzy querying for a windows-based DBMS. In: Bosc P, Kacprzyk J (eds.) Fuzziness in database management systems. Springer, Heidelberg 415–433
Kacprzyk J, Zadrozny S (2001) Computing with words in intelligent database querying: standalone and Internet-based applications. Inf Sci 34:71–109
Zadeh LA (2002) A prototype-centered approach to adding deduction capabilities to search engines—the concept of a protoform. In: BISC Seminar 2002. University of California, Berkeley
Zadeh LA (1983) A computational approach to fuzzy quantifiers in natural languages. Comput Math Appl 9:149–184
Zadeh LA (1985) Syllogistic reasoning in fuzzy logic and its application to usuality and reasoning with dispositions. IEEE Trans Syst Man Cybern SMC-15, 754–763
Yager RR (1988) On ordered weighted avaraging operators in multicriteria decision making. IEEE Trans Syst Man Cybern SMC-18, 183–190
Yager RR, Kacprzyk J (eds) (1997) The ordered weighted averaging operators: theory and applications. Kluwer, Boston
Liu Y, Kerre EE (1988) An overview of fuzzy quantifiers interpretations. Fuzzy Sets Syst 95:1–21
Kacprzyk J, Wilbik A, Zadrozny S (2008) Linguistic summarization of time series using a fuzzy quantifier driven aggregation. Fuzzy Sets Syst 159:1485–1499
Kacprzyk J, Wilbik A, Zadrozny S (2010) An approach to the linguistic summarization of time series using a fuzzy quantifier driven aggregation. Int J Intell Syst 25:411–439
Kacprzyk J, Ziółkowski A (1986) Database queries with fuzzy linguistic quantifiers. IEEE Trans Syst Man Cybern SMC-16, 474–479
Kacprzyk J, Zadrozny S, Ziółkowski A (1989) FQUERY III a human consistent database querying system based on fuzzy logic with linguistic quantifiers. Inf Syst 6:443–453
Kacprzyk J, Zadrozny S (2010) Computing with words is an implementable paradigm: fuzzy queries, linguistic data summaries, and natural-language generation. IEEE Trans Fuzzy Syst 18:461–472
Zadrozny S, Kacprzyk J (1995) Fuzzy querying using the query-by-example option in a windows based DBMS. In: Proceedings of third European congress on intelligent techniques and soft computing—EUFIT 95. Aachen, 2:733–736
George R, Srikanth R (1996) Data summarization using genetic algorithms and fuzzy logic. In: Herrera F, Verdegay JL (eds.) Genetic algorithms and soft computing. Springer, Heidelberg, 599–611
Agrawal R, Srikant R (1994) Fast algorithms for mining association rules.In: Proceedings of the 20th international conference on very large databases. Santiago,pp 487–499
Borgelt Ch, Kruse R (2001) Induction of association rules: apriori implementation. In: 15th conference on computational statistics. Springer, Berlin,pp 395–400
Kacprzyk J, Wilbik A (2010) Comparison of time series via classic and temporal protoforms of linguistic summaries: an application to mutual funds and their benchmarks. In: Borgelt Ch et al (eds) Combining soft computing and statistical methods in data analysis. Springer, Berlin, pp 369–377
Acknowledgments
This works was partially supported by the National Science Centre (contract no. UMO-2011/01/B/ST6/06908).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Atlantis Press
About this chapter
Cite this chapter
Kacprzyk, J., Zadro˙zny, S. (2012). Power of Linguistic Data Summaries and Their Protoforms. In: Kahraman, C. (eds) Computational Intelligence Systems in Industrial Engineering. Atlantis Computational Intelligence Systems, vol 6. Atlantis Press, Paris. https://doi.org/10.2991/978-94-91216-77-0_4
Download citation
DOI: https://doi.org/10.2991/978-94-91216-77-0_4
Publisher Name: Atlantis Press, Paris
Print ISBN: 978-94-91216-76-3
Online ISBN: 978-94-91216-77-0
eBook Packages: Computer ScienceComputer Science (R0)