Skip to main content

Meta-Learning - Concepts and Techniques

  • Chapter
  • First Online:
Data Mining and Knowledge Discovery Handbook

Summary

The field of meta-learning has as one of its primary goals the understanding of the interaction between the mechanism of learning and the concrete contexts in which that mechanism is applicable. The field has seen a continuous growth in the past years with interesting new developments in the construction of practical model-selection assistants, task-adaptive learners, and a solid conceptual framework. In this chapter we give an overview of different techniques necessary to build meta-learning systems. We begin by describing an idealized meta-learning architecture comprising a variety of relevant component techniques. We then look at how each technique has been studied and implemented by previous research. In addition we show how meta-learning has already been identified as an important component in real-world applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 349.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  • Aha D. W. Generalizing from Case Studies: A Case Study. Proceedings of the Ninth International Workshop on Machine Learning; 1-10, Morgan Kaufman, 1992.

    Google Scholar 

  • Ali K., Pazzani M. J. Error Reduction Through Learning Model Descriptions. Machine Learning, 24, 173-202, 1996.

    Google Scholar 

  • Andersen, P., Petersen, N.C. A Procedure for Ranking Efficient Units in Data Envelopment Analysis. Management Science, 39(10):1261-1264, 1993.

    Article  MATH  Google Scholar 

  • Baltes J. Case-Based Meta Learning: Sustained Learning Supported by a Dynamically Biased Version Space. Proceedings of the Machine Learning Workshop on Biases in Inductive Learning, 1992.

    Google Scholar 

  • Baxter, J. Theoretical Models of Learning to Learn. In Learning to Learn, Chapter 4, 71-94, MA: Kluwer Academic Publishers, 1998.

    Google Scholar 

  • Baxter, J. A Model of Inductive Learning Bias. Journal of Artificial Intelligence Research, 12: 149-198, 2000.

    MATH  MathSciNet  Google Scholar 

  • Bensusan, H. God Doesn’t Always Shave with Occam’s Razor – Learning When and How to Prune. In Proceedings of the Tenth European Conference on Machine Learning, 1998.

    Google Scholar 

  • Bensusan, H., Giraud-Carrier, C. Discovering Task Neighbourhoods Through Landmark Learning Performances. In Proceedings of the Fourth European Conference on Principles and Practice of Knowledge Discovery in Databases, 2000.

    Google Scholar 

  • Bensusan H., Giraud-Carrier C., Kennedy C. J. A Higher-Order Approach to Meta-Learning. Eleventh European Conference on Machine Learning, Workshop on Meta-Learning: Building Automatic Advice Strategies for Model Selection and Method Combination, Barcelona, Spain. 2000.

    Google Scholar 

  • Berrer, H., Paterson, I., Keller, J. Evaluation of Machine-learning Algorithm Ranking Advisors. In Proceedings of the PKDD-2000 Workshop on Data-Mining, Decision Support, Meta-Learning and ILP: Forum for Practical Problem Presentation and Prospective Solutions, 2000.

    Google Scholar 

  • Brazdil P. Data Transformation and Model Selection by Experimentation and Meta-Learning. Proceedings of the ECML-98 Workshop on Upgrading Learning to Meta-Level: Model Selection and Data Transformation, 11-17, Technical University of Chemnitz, 1998.

    Google Scholar 

  • Brazdil, P., Soares, C. A Comparison of Ranking Methods for Classification Algorithm Selection. In Proceedings of the Twelfth European Conference on Machine Learning, 2000.

    Google Scholar 

  • Brazdil, P., Soares, C., Pinto da Costa, J. Ranking Learning Algorithms: Using IBL and Meta-Learning on Accuracy and Time Results. Machine Learning, 50(3): 251-277, 2003.

    Article  MATH  Google Scholar 

  • Breiman, L. Stacked Regressions. Machine Learning, 24:49-64, 1996.

    MATH  MathSciNet  Google Scholar 

  • Brodley, C. Addressing the Selective Superiority Problem: Automatic Algorithm/Model Class Selection. Proceedings of the Tenth International Conference on Machine Learning, 17-24, San Mateo, CA, Morgan Kaufman, 1993.

    Google Scholar 

  • Brodley, C. Recursive Automatic Bias Selection for Classifier Construction. Machine Learning, 20, 1994.

    Google Scholar 

  • Brodley C., Lane T. Creating and Exploiting Coverage and Diversity. Proceedings of the AAAI-96 Workshop on Integrating Multiple Learned Models, 8-14, Portland, Oregon, 1996.

    Google Scholar 

  • Caruana, R. Multitask Learning. Second Special Issue on Inductive Transfer. Machine Learning, 28: 41-75, 1997.

    Google Scholar 

  • Chan P., Stolfo S. Experiments on Multistrategy Learning by Meta-Learning. Proceedings of the International Conference on Information Knowledge Management, 314-323, 1993.

    Google Scholar 

  • Chan, P., Stolfo, S. On the Accuracy of Meta-Learning for Scalable Data Mining. Journal of Intelligent Information Systems, 8:3-28, 1996.

    Google Scholar 

  • Chan P., Stolfo S. On the Accuracy of Meta-Learning for Scalable Data Mining. Journal of Intelligent Integration of Information, Ed. L. Kerschberg, 1998.

    Google Scholar 

  • DesJardins M., Gordon D. F. Evaluation and Selection of Biases in Machine Learning. Machine Learning, 20, 5-22, 1995.

    Google Scholar 

  • Dzeroski, Z. Is Combining Classifiers Better than Selecting the Best One? Proceedings of the Nineteenth International Conference on Machine Learning, pp 123-130, San Francisco, CA, Morgan Kaufmann, 2002.

    Google Scholar 

  • Engels, R., Theusinger, C. Using a Data Metric for Offering Preprocessing Advice in Datamining Applications. In Proceedings of the Thirteenth European Conference on Artificial Intelligence, 1998.

    Google Scholar 

  • Freund, Y., Schapire, R. E. Experiments with a New Boosting Algorithm. In Proceedings of the 13th International Conference on Machine Learning, 148-156, Morgan Kaufmann, 1996.

    Google Scholar 

  • Friedman, J., Hastie, T., Tibshirani, R. Additive Logistic Regression: A Statistical View of Boosting. Annals of Statistics 28: 337-387, 2000.

    Article  MATH  MathSciNet  Google Scholar 

  • Fürnkranz, J., Petrak J. An Evaluation of Landmarking Variants, in C. Giraud-Carrier, N. Lavrac, Steve Moyle, and B. Kavsek, editors, Working Notes of the ECML/PKDD 2000 Workshop on Integrating Aspects of Data Mining, Decision Support and Meta-Learning, 2001.

    Google Scholar 

  • Gama, J., Brazdil, P. A Characterization of Classification Algorithms. Proceedings of the Seventh Portuguese Conference on Artificial Intelligence, EPIA, 189-200, Funchal, Madeira Island, Portugal, 1995.

    Google Scholar 

  • Gama, J., Brazdil P. Cascade Generalization, Machine Learning,41(3), Kluwer, 2000.

    Google Scholar 

  • Giraud-Carrier, C. Beyond Predictive Accuracy: What? Proceedings of the ECML-98Workshop on Upgrading Learning to Meta-Level: Model Selection and Data Transformation, 78-85, Technical University of Chemnitz, 1998.

    Google Scholar 

  • Giraud-Carrier, C., Vilalta, R., Brazdil, P. Introduction to the Special Issue on Meta-Learning. Machine Learning, 54: 187-193, 2004.

    Article  Google Scholar 

  • Gordon D. Perlis D. Explicitly Biased Generalization. Computational Intelligence, 5, 67-81, 1989.

    Article  Google Scholar 

  • Gordon D. F. Active Bias Adjustment for Incremental, Supervised Concept Learning. PhD Thesis, University of Maryland, 1990.

    Google Scholar 

  • Hastie, T., Tibshirani, R., Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer Series, 2001.

    Google Scholar 

  • Hilario, M., Kalousis, A. Building Algorithm Profiles for Prior Model Selection in Knowledge Discovery Systems. Engineering Intelligent Systems, 8(2), 2000.

    Google Scholar 

  • Keller, J., Holzer, I., Silvery, S. Using Data Envelopment Analysis and Cased-based Reasoning Techniques for Knowledge-based Engine-intake Port Design. In Proceedings of the Twelfth International Conference on Engineering Design, 1999.

    Google Scholar 

  • Keller, J., Paterson, I., Berrer, H. An Integrated Concept for Multi-Criteria-Ranking of Data-Mining Algorithms. Eleventh European Conference on Machine Learning,Workshop on Meta-Learning: Building Automatic Advice Strategies for Model Selection and Method Combination, Barcelona, Spain, 2000.

    Google Scholar 

  • Merz C. Dynamic Learning Bias Selection. Preliminary papers of the Fifth International Workshop on Artificial Intelligence and Statistics, 386-395, Florida, 1995A.

    Google Scholar 

  • Merz C. Dynamical Selection of Learning Algorithms. Learning from Data: Artificial Intelligence and Statistics, D. Fisher and H. J. Lenz (Eds.), Springer-Verlag, 1995B.

    Google Scholar 

  • Metal. A Meta-Learning Assistant for Providing User Support in Machine Learning and Data Mining, 1998.

    Google Scholar 

  • Michie, D., Spiegelhalter, D. J., Taylor, C.C. Machine Learning, Neural and Statistical Classification. England: Ellis Horwood, 1994.

    MATH  Google Scholar 

  • Nakhaeizadeh, G., Schnabel, A. Development of Multi-criteria Metrics for Evaluation of Data-mining Algorithms. In Proceedings of the Third International Conference on Knowledge Discovery and Data-Mining, 1997.

    Google Scholar 

  • Paterson, I. New Models for Data Envelopment Analysis, Measuring Efficiency with the VRS Frontier. Economics Series No. 84, Institute for Advanced Studies, Vienna, 2000.

    Google Scholar 

  • Peng, Y., Flach, P., Brazdil, P., Soares, C. Decision Tree-Based Characterization for Meta-Learning. In: ECML/PKDD’02 Workshop on Integration and Collaboration Aspects of Data Mining, Decision Support and Meta-Learning, 111-122. University of Helsinki, 2002.

    Google Scholar 

  • Pfahringer, B., Bensusan, H., Giraud-Carrier, C. Meta-learning by Landmarking Various Learning Algorithms. In Proceedings of the Seventeenth International Conference on Machine Learning, 2000.

    Google Scholar 

  • Pratt, L., Thrun, S. Second Special Issue on Inductive Transfer. Machine Learning, 28, 1997.

    Google Scholar 

  • Pratt S., Jennings B. A Survey of Connectionist Network Reuse Through Transfer. In Learning to Learn, Chapter 2, 19-43, Kluwer Academic Publishers, MA, 1998.

    Google Scholar 

  • Rokach, L., Averbuch, M., and Maimon, O., Information retrieval system for medical narrative reports. Lecture notes in artificial intelligence, 3055. pp. 217-228, Springer-Verlag (2004).

    Google Scholar 

  • Schmidhuber J. Discovering Solutions with Low Kolmogorov Complexity and High Generalization Capability. Proceedings of the Twelve International Conference on Machine Learning, 488-49, Morgan Kaufman, 1995.

    Google Scholar 

  • Skalak, D. Prototype Selection for Composite Nearest Neighbor Classifiers. PhD thesis, University of Massachusetts, Amherst, 1997.

    Google Scholar 

  • Soares, C., Brazdil, P. Zoomed Ranking: Selection of Classification Algorithms Based on Relevant Performance Information. In Proceedings of the Fourth European Conference on Principles and Practice of Knowledge Discovery in Databases, 2000.

    Google Scholar 

  • Soares, C., Petrak, J., Brazdil, P. Sampling-Based Relative Landmarks: Systematically Test-Driving Algorithms Before Choosing. Proceedings of the 10th Portuguese Conference on Artificial Intelligence, Springer, 2001.

    Google Scholar 

  • Sohn, S.Y. Meta Analysis of Classification Algorithms for Pattern Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 21(11): 1137-1144, 1999.

    Article  Google Scholar 

  • Thrun, S. Lifelong Learning Algorithms. In Learning to Learn, Chapter 8, 181-209, MA: Kluwer Academic Publishers, 1998.

    Google Scholar 

  • Ting, K. M., Witten I. H. Stacked generalization: When does it work?. In Proceedings of the 15th International Joint Conference on Artificial Intelligence, pp 866-873, Nagoya, Japan, Morgan Kaufmann, 1997.

    Google Scholar 

  • Todorovski, L., Dzeroski, S. Experiments in Meta-level Learning with ILP. In Proceedings of the Third European Conference on Principles and Practice of Knowledge Discovery in Databases, 1999.

    Google Scholar 

  • Todorovski, L., Dzeroski, S. Combining Multiple Models with Meta Decision Trees. In Proceedings of the Fourth European Conference on Principles and Practice of Knowledge Discovery in Databases, 2000.

    Google Scholar 

  • Todorovski, L., Dzeroski, S. Combining Classifiers with Meta Decision Trees. Machine Learning 50 (3), 223-250, 2003.

    Article  MATH  Google Scholar 

  • Utgoff P. Shift of Bias for Inductive Concept Learning. In Michalski, R.S. et al (Ed), Machine Learning: An Artificial Intelligence Approach Vol. II, 107-148, Morgan Kaufman, California, 1986.

    Google Scholar 

  • Vilalta, R. Research Directions in Meta-Learning: Building Self-Adaptive Learners. International Conference on Artificial Intelligence, Las Vegas, Nevada, 2001.

    Google Scholar 

  • Vilalta, R., Drissi, Y. A Perspective View and Survey of Meta-Learning. Journal of Artificial Intelligence Review, 18 (2): 77-95, 2002.

    Article  Google Scholar 

  • Widmer, G. On-line Metalearning in Changing Contexts. MetaL(B) and MetaL(IB). In Proceedings of the Third International Workshop on Multistrategy Learning (MSL-96), 1996A.

    Google Scholar 

  • Widmer, G. Recognition and Exploitation of Contextual Clues via Incremental Meta-Learning. In Proceedings of the Thirteenth International Conference on Machine Learning (ICML-96), 1996B.

    Google Scholar 

  • Widmer, G. Tracking Context Changes through Meta-Learning. Machine Learning, 27(3):259-286, 1997.

    Article  Google Scholar 

  • Wolpert D. Stacked Generalization. Neural Networks, 5: 241-259, 1992.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Vilalta, R., Giraud-Carrier, C., Brazdil, P. (2009). Meta-Learning - Concepts and Techniques. In: Maimon, O., Rokach, L. (eds) Data Mining and Knowledge Discovery Handbook. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-09823-4_36

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-09823-4_36

  • Published:

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-09822-7

  • Online ISBN: 978-0-387-09823-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics