Coarsest granularity-based optimal reduct using A* search

Bar, Abhimanyu; Kumar, Anil; Sai Prasad, P. S. V. S.

doi:10.1007/s41066-022-00313-6

Coarsest granularity-based optimal reduct using A^* search

Original Paper
Published: 08 March 2022

Volume 8, pages 45–66, (2023)
Cite this article

Granular Computing Aims and scope Submit manuscript

130 Accesses
2 Citations
Explore all metrics

Abstract

The optimal reduct computation problem aims to obtain the best reduct out of all possible reducts of a given decision system. In the rough set literature, two optimality criteria exist for computing an optimal reduct: shortest length based and coarsest granular space based. The coarsest granular space-based optimal reduct has the ability to induce a better generalizable classification model. The \(A^*RSOR\) is an existing \(A^*\) search-based optimal reduct computation algorithm that uses the coarsest granular space as an optimality criterion. This article proposes an improved coarsest granularity-based optimal reduct approach \(MA^*\_RSOR\) through analyzing the search process’s behaviour in \(A^*RSOR\) algorithm. To minimize the search space utilization and arrive at an optimal reduct in less time, suitable modifications are incorporated using the domain knowledge of rough set theory. The relevance of \(MA^*\_RSOR\) is demonstrated through theoretical analysis and comparative experimental validation with state-of-the-art algorithms. The experimental results with benchmark data sets established that \(MA^*\_RSOR\) achieves significant computational time gain (\(49-99\%\)) and space reduction (\(37-96\%\)) over \(A^*RSOR\). The \(MA^*\_RSOR\) could induce classification models with significantly better classification accuracies than state-of-the-art shortest length-based optimal/near-optimal reduct computation algorithms. In addition, a coefficient of variation based \(CV_{\text {NonCore}}\) heuristic is proposed for predicting when the \(MA^*\_RSOR\) algorithm is appropriate to use. Experimental results validate the relevance of the heuristic as its prediction turned out correctly in 8 out of 10 data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Approaches for coarsest granularity based near-optimal reduct computation

Article 06 June 2022

Finding Optimal Rough Set Reduct with A* Search Algorithm

Data-guided multi-granularity selector for attribute reduction

Article 06 September 2020

Availability of data and materials

Manuscript has no associated data.

Notes

Note: Without loss of generality, \(U/IND(\emptyset )\) is taken as \(\{U\}\). That means objects are not distinguishable in absence of any attributes information.

References

Abdi H (2010) Coefficient of variation. Encyclopedia Res Des 1:169–171
Google Scholar
Arel-Bundock V (2012) Rdatasets: An archive of datasets distributed with R. https://vincentarelbundock.github.io/Rdatasets/datasets.html
Bar A, Kumar A, Sai Prasad P (2019) Finding optimal rough set reduct with A* search algorithm. International Conference on Pattern Recognition and Machine Intelligence( PReMI 2019), vol 11941. Springer, Lecture Notes in Computer Science, pp 317–327
Barron A, Rissanen J, Yu B (1998) The minimum description length principle in coding and modeling. IEEE Trans Inf Theory 44(6):2743–2760
Article MathSciNet MATH Google Scholar
Bazan JG, Szczuka M (2005) The rough set exploration system. In: Transactions on Rough Sets III. Springer, pp 37–56, https://www.mimuw.edu.pl/~szczuka/rses/get.html
Benouini R, Batioua I, Ezghari S et al (2020) Fast feature selection algorithm for neighborhood rough set model based on bucket and trie structures. Granular Comput 5:329–347
Article Google Scholar
Chen D, Zhao S, Zhang L et al (2012) Sample pair selection for attribute reduction with rough set. IEEE Trans Knowl Data Eng 24(11):2080–2093
Article Google Scholar
Chen Y, Zhu Q, Xu H (2015) Finding rough set reducts with fish swarm algorithm. Knowl-Based Syst 81:22–29
Article Google Scholar
Choromański M, Grześ T, Hońko P (2020) Breadth search strategies for finding minimal reducts: towards hardware implementation. Neural Comput Appl 32:14801–14816
Article Google Scholar
Chouchoulas A, Shen Q (2001) Rough set-aided keyword reduction for text categorization. Appl Artif Intell 15(9):843–873
Article Google Scholar
Dai J, Hu Q, Zhang J et al (2016) Attribute selection for partially labeled categorical data by rough set approach. IEEE Trans Cybernet 47(9):2460–2471
Article Google Scholar
Dash M, Liu H (1997) Feature selection for classification. Intell Data Anal 1(3):131–156
Article Google Scholar
Dua D, Graff C (2017) UCI machine learning repository. http://archive.ics.uci.edu/ml
Ferone A (2018) Feature selection based on composition of rough sets induced by feature granulation. Int J Approx Reason 101:276–292
Article MathSciNet MATH Google Scholar
Gao C, Lai Z, Zhou J et al (2018) Maximum decision entropy-based attribute reduction in decision-theoretic rough set model. Knowl-Based Syst 143:179–191
Article Google Scholar
Geng Z, Zhu Q (2009) Rough set-based heuristic hybrid recognizer and its application in fault diagnosis. Expert Syst Appl 36(2):2711–2718
Article Google Scholar
Grzymala-Busse JW (1992) LERS-a system for learning from examples based on rough sets. In: Intelligent decision support, vol 11. Springer, pp 3–18
Han J, Hu X, Lin TY (2004) Feature subset selection based on relative dependency between attributes. In: Neuromuscular junction. Handbook of experimental pharmacology, vol 3066. Springer, pp 176–185
Google Scholar
Hart PE, Nilsson NJ, Raphael B (1968) A formal basis for the heuristic determination of minimum cost paths. IEEE Trans Syst Sci Cybernet 4(2):100–107
Article Google Scholar
Hart PE, Nilsson NJ, Raphael B (1972) Correction to a formal basis for the heuristic determination of minimum cost paths. SIGART Bull 37:28–29
Article Google Scholar
Hu X (1995) Knowledge discovery in databases: an attribute-oriented rough set approach. PhD thesis, University of Regina Regina, Canada
Hu K, Diao L, Lu Y, et al (2000) A heuristic optimal reduct algorithm. In: International Conference on Intelligent Data Engineering and Automated Learning, vol 1983. Springer, pp 139–144
Jensen R, Shen Q (2003) Finding rough set reducts with ant colony optimization. In: Proceedings of the 2003 UK workshop on computational intelligence, vol 1. Springer, pp 15–22
Google Scholar
Jensen R, Shen Q (2004) Semantics-preserving dimensionality reduction: rough and fuzzy-rough-based approaches. IEEE Trans Knowl Data Eng 16(12):1457–1471
Article Google Scholar
Karpinski M, Schudy W (2011) Approximation schemes for the betweenness problem in tournaments and related ranking problems. In: Approximation, Randomization, and combinatorial optimization. Algorithms and Techniques, vol 6845. Springer, pp 277–288
Kumar A, Prasad PS (2020) Scalable fuzzy rough set reduct computation using fuzzy min-max neural network preprocessing. IEEE Trans Fuzzy Syst 28(5):953–964
Article Google Scholar
Lavangnananda K, Chattanachot S (2017) Study of discretization methods in classification. In: 2017 9th International Conference on Knowledge and Smart Technology (KST), vol 16774343. IEEE, pp 50–55
Li Y, Shiu SCK, Pal SK et al (2006) A rough set-based case-based reasoner for text categorization. Int J Approx Reason 41(2):229–255
Article Google Scholar
Li F, Miao D, Pedrycz W (2017) Granular multi-label feature selection based on mutual information. Pattern Recogn 67:410–423
Article Google Scholar
Li W, Jia X, Wang L et al (2019) Multi-objective attribute reduction in three-way decision-theoretic rough set model. Int J Approx Reason 105:327–341
Article MathSciNet MATH Google Scholar
Liang D, Pedrycz W, Liu D et al (2015) Three-way decisions based on decision-theoretic rough sets under linguistic assessment with the aid of group decision making. Appl Soft Comput 29:256–269
Article Google Scholar
Liu H, Hussain F, Tan CL et al (2002) Discretization:an enabling technique. Data Min Knowl Disc 6(4):393–423
Article MathSciNet Google Scholar
Mafarja MM, Mirjalili S (2019) Hybrid binary ant lion optimizer with rough set and approximate entropy reducts for feature selection. Soft Comput 23(15):6249–6265
Article Google Scholar
Mahajan P, Kandwal R, Vijay R (2012) Rough set approach in machine learning: a review. Int J Comput Appl 56(10):1–13
Google Scholar
Mitra S, Mitra M, Chaudhuri BB (2006) A rough-set-based inference engine for ecg classification. IEEE Trans Instrum Meas 55(6):2198–2206
Article MATH Google Scholar
Moshkov M, Zielosko B (2011) Combinatorial machine learning: a rough set approach, vol 360. Springer Science & Business Media
MATH Google Scholar
Pawlak Z (1982) Rough sets. Int J Comput Inf Sci 11(5):341–356
Article MATH Google Scholar
Pawlak Z (1991) Rough Sets: theoretical aspects of reasoning about data, vol 9. Springer Science & Business Media
Book MATH Google Scholar
Pawlak Z, Skowron A (2007) Rough sets and boolean reasoning. Inf Sci 177(1):41–73
Article MathSciNet MATH Google Scholar
Qian J, Liu C, Yue X (2019) Multigranulation sequential three-way decisions based on multiple thresholds. Int J Approx Reason 105:396–416
Article MathSciNet MATH Google Scholar
Raza MS, Qamar U (2018) Feature selection using rough set-based direct dependency calculation by avoiding the positive region. Int J Approx Reason 92:175–197
Article MathSciNet MATH Google Scholar
Rodríguez-Diez V, Martínez-Trinidad JF, Carrasco-Ochoa JA et al (2020) Minreduct: a new algorithm for computing the shortest reducts. Pattern Recogn Lett 138:177–184
Article Google Scholar
Sai Prasad P, Rao CR (2011) Extensions to iquick reduct. In: International Workshop on Multi-disciplinary Trends in Artificial Intelligence MIWAI’11, vol 7080. Springer, pp 351–362
Shi Y, Huang Y, Wang C et al (2019) Attribute reduction based on the boolean matrix. Granular Comput 4(3):313–322
Article Google Scholar
Skowron A, Rauszer C (1992) The discernibility matrices and functions in information systems. In: Intelligent decision support, vol 11. Springer, pp 331–362
Starzyk J, Nelson DE, Sturtz K (1999) Reduct generation in information systems. Bull Int Rough Set Soc 3(1/2):19–22
MATH Google Scholar
Starzyk JA, Nelson DE, Sturtz K (2000) A mathematical foundation for improved reduct generation in information systems. Knowl Inf Syst 2(2):131–146. https://doi.org/10.1007/s101150050007
Article MATH Google Scholar
Swiniarski RW, Skowron A (2003) Rough set methods in feature selection and recognition. Pattern Recogn Lett 24(6):833–849
Article MATH Google Scholar
Tan A, Li J, Lin Y et al (2015) Matrix-based set approximations and reductions in covering decision information systems. Int J Approx Reason 59:68–80
Article MathSciNet MATH Google Scholar
Vitányi PM, Li M (2000) Minimum description length induction, bayesianism, and kolmogorov complexity. IEEE Trans Inf Theory 46(2):446–464
Article MathSciNet MATH Google Scholar
Wang X, Yang J, Teng X et al (2007) Feature selection based on rough sets and particle swarm optimization. Pattern Recogn Lett 28(4):459–471
Article Google Scholar
Wang C, Shi Y, Fan X et al (2019) Attribute reduction based on k-nearest neighborhood rough sets. Int J Approx Reason 106:18–31
Article MathSciNet MATH Google Scholar
Wroblewski J (1995) Finding minimal reducts using genetic algorithms. In: Proccedings of the second annual join conference on infromation science, pp 186–189
Yao Y, Zhao Y, Wang J (2008) On reduct construction algorithms. Transactions on Computational Science II. Springer, Berlin Heidelberg, Berlin, Heidelberg, pp 100–117
Zhan J, Ali MI, Mehmood N (2017) On a novel uncertain soft set model: Z-soft fuzzy rough set model and corresponding decision making methods. Appl Soft Comput 56:446–457
Article Google Scholar
Zhang K, Zhan J, Wu WZ (2020) Novel fuzzy rough set models and corresponding applications to multi-criteria decision-making. Fuzzy Sets Syst 383:92–126
Article MathSciNet MATH Google Scholar
Zhang J, Wang J, Li D, et al (2003) A new heuristic reduct algorithm base on rough sets theory. In: International Conference on Web-Age Information Management, Springer, pp 247–253
Zhao J, Jm Liang, Zn Dong et al (2020) Accelerating information entropy-based feature selection using rough set theory with classified nested equivalence classes. Pattern Recogn 107(107):517
Google Scholar
Ziarko W (2002) Rough set approaches for discovery of rules and attribute dependencies. Handbook of data mining and knowledge discovery pp 328–338

Download references

Acknowledgements

This research acknowledges the financial support of UoH-IoE by MHRD (F11/9/2019-U3(A)). The first author acknowledges the support of the Senior Research Fellowship from the Council of Scientific and Industrial Research (CSIR) (09/414(1117)/2016-EMR-I), Ministry of Science and Technology, Government of India.

Author information

Authors and Affiliations

School of Computer and Information Sciences, University of Hyderabad, Central University P.O, Hyderabad, Telangana, 500046, India
Abhimanyu Bar, Anil Kumar & P. S. V. S. Sai Prasad

Authors

Abhimanyu Bar
View author publications
You can also search for this author in PubMed Google Scholar
Anil Kumar
View author publications
You can also search for this author in PubMed Google Scholar
P. S. V. S. Sai Prasad
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Abhimanyu Bar, Anil Kumar or P. S. V. S. Sai Prasad.

Ethics declarations

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bar, A., Kumar, A. & Sai Prasad, P.S.V.S. Coarsest granularity-based optimal reduct using A^* search. Granul. Comput. 8, 45–66 (2023). https://doi.org/10.1007/s41066-022-00313-6

Download citation

Received: 23 November 2021
Accepted: 13 February 2022
Published: 08 March 2022
Issue Date: January 2023
DOI: https://doi.org/10.1007/s41066-022-00313-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Coarsest granularity-based optimal reduct using A^* search

Abstract

Access this article

Similar content being viewed by others

Approaches for coarsest granularity based near-optimal reduct computation

Finding Optimal Rough Set Reduct with A* Search Algorithm

Data-guided multi-granularity selector for attribute reduction

Availability of data and materials

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Coarsest granularity-based optimal reduct using A* search

Abstract

Access this article

Similar content being viewed by others

Approaches for coarsest granularity based near-optimal reduct computation

Finding Optimal Rough Set Reduct with A* Search Algorithm

Data-guided multi-granularity selector for attribute reduction

Availability of data and materials

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Coarsest granularity-based optimal reduct using A^* search