A Mixed Learning Strategy for Finding Typical Testors in Large Datasets

González-Guevara, Víctor Iván; Godoy-Calderon, Salvador; Alba-Cabrera, Eduardo; Ibarra-Fiallo, Julio

doi:10.1007/978-3-319-25751-8_86

A Mixed Learning Strategy for Finding Typical Testors in Large Datasets

Víctor Iván González-Guevara¹⁵,
Salvador Godoy-Calderon¹⁵,
Eduardo Alba-Cabrera¹⁶ &
…
Julio Ibarra-Fiallo¹⁶

Conference paper
First Online: 25 October 2015

2393 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9423))

Abstract

This paper presents a mixed, global and local, learning strategy for finding typical testors in large datasets. The goal of the proposed strategy is to allow any search algorithm to achieve the most significant reduction possible in the search space of a typical testor-finding problem. The strategy is based on a trivial classifier which partitions the search space into four distinct classes and allows the assessment of each feature subset within it. Each class is handled by slightly different learning actions, and induces a different reduction in the search-space of a problem. Any typical testor-finding algorithm, whether deterministic or metaheuristc, can be adapted to incorporate the proposed strategy and can take advantage of the learned information in diverse manners.

Mexican authors wish to thank CONACyT and SIP-IPN for their support of this research, particularly through grant SIP-20151393. Also, Ecuatorian authors wish to thank the financial support received from USFQ-Small Grants.

Download to read the full chapter text

Chapter PDF

References

Quinlan, J. R.: C4.5: Programs for Machine Learning. Published by Morgan Kaufmann Publishers Inc. (1993)
Google Scholar
Mannila, H., Toivonen, H., Verkamo, A.: Discovery of Frequent Episodes in Event Sequences. Data Min. Knowl. Discov. 1(3), 258–289 (1997)
Google Scholar
Buddeewong, S., Kreesuradej, W.: A new association rule-based text classifier algorithm. In: Proceedings of the 17th IEEE International Conference on Tools with Artificial Intelligence, pp. 684–685 (2005)
Google Scholar
Xei, F., Wu, X., Zhu, X.: Document-Specific Keyphrase Extraction Using Sequential Patterns with Wildcards. In: Proceedings of the IEEE 14th International Conference on Data Mining (2014)
Google Scholar
Haleem, H., Kumar, P., Beg, S.: Novel frequent sequential patterns based probabilistic model for effective classification of web documents. In: 2014 International Conference on Computer and Communication Technology (ICCCT), pp. 361–371 (2014)
Google Scholar
Srikant, R., Agrawal, R.: Mining Sequential Patterns: Generalizations and Performance Improvements. In: Proceeding in the 5th International Conference Extending Database Technology, pp. 3–17 (1996)
Google Scholar
Pei, J., Han, J., Mortazavi-asl, B., Pinto, H., Chen, Q., Dayal U., Hsu, M.: PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth. In: Proceedings of the 17th International Conference on Data Engineering, pp. 215–224 (2001)
Google Scholar
Yang, Z., Wang, Y., Kitsuregawa, M.: LAPIN: effective sequential pattern mining algorithms by last position induction for dense databases. In: Kotagiri, R., Radha Krishna, P., Mohania, M., Nantajeewarawat, E. (eds.) DASFAA 2007. LNCS, vol. 4443, pp. 1020–1023. Springer, Heidelberg (2007)
Chapter Google Scholar
Gouda, K., Hassaan, M., Zaki, M.J.: Prism: An effective approach for frequent sequence mining via prime-block encoding. J. Comput. Syst. Sci. 76(1), 88–102 (2010)
Article MathSciNet MATH Google Scholar
Steinbach, M., Kumar, V.: Generalizing the Notion of Confidence. In: Proceedings of the ICDM, pp. 402–409 (2005)
Google Scholar
Wang, Y., Xin, Q., Coenen, F.: Hybrid Rule Ordering in Classification Association Rule Mining. Trans. MLDM 1(1), 1–15 (2008)
Google Scholar
Hernández, R., Carrasco, J.A., Martínez, JFco, Hernández, J.: Combining Hybrid Rule Ordering Strategies Based on Netconf and a Novel Satisfaction Mechanism for CAR-based Classifiers. Intell. Data Anal. 18(6S), S89–S100 (2014)
Google Scholar
Ahn, K.I., Kim, J.Y.: Efficient Mining of Frequent Itemsets and a Measure of Interest for Association Rule Mining. Information and Knowledge Management 3(3), 245–257 (2004)
Article MathSciNet Google Scholar
Frank, E., Witten, I. H.: Generating Accurate Rule Sets Without Global Optimization. In: Proceedings of the 15th International Conference on Machine Learning, pp. 144–151 (1998)
Google Scholar
Cortes, C., Vapnik, V.: Support-Vector Networks. Mach. Learn. 20(3), 273–297 (1995)
MATH Google Scholar
Hernández, R., Carrasco, J.A., Martínez, JFco, Hernández, J.: CAR-NF: A classifier based on specific rules with high netconf. Intell. Data Anal. 16(1), 49–68 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Instituto Politécnico Nacional, Centro de Investigación en Computación (CIC), Av. Luis Enrique Erro S/N, Unidad Profesional Adolfo López Mateos, Zacatenco, Delegación Gustavo A. Madero, 07738, Ciudad de Mexico, Mexico
Víctor Iván González-Guevara & Salvador Godoy-Calderon
Colegio de Ciencias e Ingenierías, Departamento de Matemáticas, Universidad San Francisco de Quito (USFQ), Diego de Robles y Vía Interoceanica, Quito, Ecuador
Eduardo Alba-Cabrera & Julio Ibarra-Fiallo

Authors

Víctor Iván González-Guevara
View author publications
You can also search for this author in PubMed Google Scholar
Salvador Godoy-Calderon
View author publications
You can also search for this author in PubMed Google Scholar
Eduardo Alba-Cabrera
View author publications
You can also search for this author in PubMed Google Scholar
Julio Ibarra-Fiallo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Eduardo Alba-Cabrera .

Editor information

Editors and Affiliations

Univ. Católica del Uruguay, Montevideo, Uruguay
Alvaro Pardo
University of Surrey, Guildford, United Kingdom
Josef Kittler

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

González-Guevara, V.I., Godoy-Calderon, S., Alba-Cabrera, E., Ibarra-Fiallo, J. (2015). A Mixed Learning Strategy for Finding Typical Testors in Large Datasets. In: Pardo, A., Kittler, J. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2015. Lecture Notes in Computer Science(), vol 9423. Springer, Cham. https://doi.org/10.1007/978-3-319-25751-8_86

Download citation

DOI: https://doi.org/10.1007/978-3-319-25751-8_86
Published: 25 October 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25750-1
Online ISBN: 978-3-319-25751-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)