Induction of Modular Classification Rules by Information Entropy Based Rule Generation

Liu, Han; Gegov, Alexander

doi:10.1007/978-3-319-27267-2_7

Han Liu⁶ &
Alexander Gegov⁶

Part of the book series: Studies in Computational Intelligence ((SCI,volume 623))

544 Accesses
3 Citations

Abstract

Prism has been developed as a modular classification rule generator following the separate and conquer approach since 1987 due to the replicated sub-tree problem occurring in Top-Down Induction of Decision Trees (TDIDT). A series of experiments have been done to compare the performance between Prism and TDIDT which proved that Prism may generally provide a similar level of accuracy as TDIDT but with fewer rules and fewer terms per rule. In addition, Prism is generally more tolerant to noise with consistently better accuracy than TDIDT. However, the authors have identified through some experiments that Prism may also give rule sets which tend to underfit training sets in some cases. This paper introduces a new modular classification rule generator, which follows the separate and conquer approach, in order to avoid the problems which arise with Prism. In this paper, the authors review the Prism method and its advantages compared with TDIDT as well as its disadvantages that are overcome by a new method using Information Entropy Based Rule Generation (IEBRG). The authors also set up an experimental study on the performance of the new method in classification accuracy and computational efficiency. The method is also evaluated comparatively with Prism.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufman (1993)
Google Scholar
Hunt, E.B., Stone, P.J., Marin, J.: Experiments in Induction. Academic Press, New York (1966)
Google Scholar
Michalski, R.S.: On the quasi-minimal solution of the general covering problem. In: Proceedings of the Fifth International Symposium on Information Processing, Bled, Yugoslavia, pp. 125–128 (1969)
Google Scholar
Bramer, M.A.: Principles of Data Mining. Springer, London (2007)
MATH Google Scholar
Cendrowska, J.: PRISM: an algorithm for inducing modular rules. Int. J. Man Mach. Stud. 27, 349–370 (1987)
Article MATH Google Scholar
Shannon, C.: A mathematical theory of communication. Bell Syst. Tech. J. 27(3), 379–423 (1948)
Google Scholar
Stahl, F., Bramer, M.: Induction of modular classification rules: using Jmax-pruning. In: 30th SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence, Cambridge (2011)
Google Scholar
Stahl, F., Bramer, M.: Jmax-pruning: a facility for the information theoretic pruning of modular classification rules. Knowl.-Based Syst. 29(2012), 12–19 (2012)
Article Google Scholar
Deng, X.: A Covering-Based Algorithm for Classification: PRISM. CS831: Knowledge Discover in Databases (2012)
Google Scholar
Kerber, R.: Chimerge: discretization of numeric attributes. In: AAAI’92 Proceedings of the tenth national conference on Artificial intelligence, pp. 123–128 (1992)
Google Scholar
Bramer, M.A.: Inducer: a public domain workbench for data mining. Int. J. Syst. Sci. 36, 909–919 (2005)
Article MATH Google Scholar
Stahl, F., Bramer, M.: Computationally efficient induction of classification rules with the PMCRI and J-PMCRI frameworks. Knowl. Based Syst. 35(2012), 49–63 (2012)
Article Google Scholar
Bramer, M.A.: Automatic induction of classification rules from examples using N-Prism. In: Research and Development in Intelligent Systems XVI, pp. 99–121. Springer, Berlin (2000)
Google Scholar
Blake, C.L., Merz, C.J.: UCI repository of machine learning databases. Technical Report, University of California, Irvine, Department of Information and Computer Sciences (1998)
Google Scholar
Bramer, M.A.: Using J-pruning to reduce overfitting of classification rules in noisy domains. In: Proceedings of 13th International Conference on Database and Expert Systems Applications— DEXA 2002, Aix-en-Provence, France, 2–6 Sept 2002
Google Scholar
Bramer, M.A.: Using J-pruning to reduce overfitting in classification trees. In: Research and Development in Intelligent Systems XVIII, pp. 25–38. Springer, Berlin (2002)
Google Scholar
Smyth, P., Goodman, R.M.: Rule induction using information theory. In: Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases, pp. 159–176. AAAI Press (1991)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing, University of Portsmouth, Buckingham Building, Lion Terrace, Portsmouth, PO1 3HE, UK
Han Liu & Alexander Gegov

Authors

Han Liu
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Gegov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Han Liu .

Editor information

Editors and Affiliations

Inst of Info & Commu Tech, Bulgarian Academy of Sciences, Sofia, Bulgaria
Vassil Sgurev
Iona College, Machine Intelligence Institute, New Rochelle, New York, USA
Ronald R. Yager
Sys Res Intit of Polish Acad of Science, Intelligent Systems Laboratory, Warsaw, Poland
Janusz Kacprzyk
and Information Technologies, University of Library Studies, Sofia, Bulgaria
Vladimir Jotsov

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Liu, H., Gegov, A. (2016). Induction of Modular Classification Rules by Information Entropy Based Rule Generation. In: Sgurev, V., Yager, R., Kacprzyk, J., Jotsov, V. (eds) Innovative Issues in Intelligent Systems. Studies in Computational Intelligence, vol 623. Springer, Cham. https://doi.org/10.1007/978-3-319-27267-2_7

Download citation

DOI: https://doi.org/10.1007/978-3-319-27267-2_7
Published: 03 February 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27266-5
Online ISBN: 978-3-319-27267-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics