Skip to main content
Log in

A novel algorithm for mining couples of enhanced association rules based on the number of output couples and its application

  • Research
  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

Besides the need for more advanced predictive methods, there is increasing demand for easily interpretable results. Couples of enhanced association rules (a generalization of association rules/apriori/frequent itemsets) are excellent candidates for this task. They can be interpreted in various ways, subgroup discovery being an example. A typical result in rule mining is that there are too low or too many rules in the resulting ruleset. Analysts must usually iterate 5–15 times to get a reasonable number of rules. Inspired by research in a similar area of frequent itemsets to simplify input and parameter-free frequent itemsets, we have proposed a novel algorithm that finds rules based not on parameters like support and confidence but the best rules by a given range of required rule count in output. We propose this algorithm for couples of rules – SD4ft-Miner procedure and benefits from a brand new implementation of methods of mechanizing hypothesis formation in Python called Cleverminer that allows easy implementation of this algorithm. We have verified the algorithm by several applications on eight public data sets. Our original case was a case study, and it was also the reason why we developed the algorithm. However, implementation is in Python, and the algorithm itself can be used on a broader class of methods in any language. The algorithm iterates quickly, in all experiments we needed a maximum of 10 iterations. Possible enhancements to this algorithm are also outlined.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Algorithm 1

Similar content being viewed by others

Data Availability

Dataset used is a publicly accessible dataset referenced in the manuscript. The repository with detailed results and source code enabling replication of experiments is also publicly available.

Notes

  1. we use terms rule and SD4ft-rule interchangeably as this article is about SD4ft-Miner and rules it finds

  2. In the proposed algorithm, the maximum number of iterations is the parameter and is set to 100.

  3. no specification means subset 1–1 for nominal attributes and sequence 1–1 for ordinal attributes (note that from the rule-mining point of view, these two definitions are equivalent)

  4. Rules are displayed in order of how they are returned from CleverMiner package ver. 1.0.2, as this version currently does not provide how to order them and is supposed to do so by manual work in postprocessing.

  5. attributes without specification are again subsets 1–1 or sequences 1–1

References

Download references

Acknowledgements

Not Applicable.

Funding

No funding received. Authors received no financial support for the research and the authorship of this manuscript.

Author information

Authors and Affiliations

Authors

Contributions

Petr Máša: Algorithm described in Section 5, design of the system of analytic tasks, running analytic tasks, related work sections, editing, repository; Jan Rauch: design of the system of analytic tasks, related work sections, editing.

Corresponding author

Correspondence to Petr Máša.

Ethics declarations

Conflicts of interest

The authors declare no competing interests.

Ethics approval and consent to participate

Not Applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Máša, P., Rauch, J. A novel algorithm for mining couples of enhanced association rules based on the number of output couples and its application. J Intell Inf Syst (2023). https://doi.org/10.1007/s10844-023-00820-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10844-023-00820-1

Keywords

Navigation