Skip to main content

Model-Free and Model-Based Active Learning for Regression

  • Conference paper
  • First Online:
Advances in Computational Intelligence Systems

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 513))

Abstract

Training machine learning models often requires large labelled datasets, which can be both expensive and time-consuming to obtain. Active learning aims to selectively choose which data is labelled in order to minimize the total number of labels required to train an effective model. This paper compares model-free and model-based approaches to active learning for regression, finding that model-free approaches, in addition to being less computationally intensive to implement, are more effective in improving the performance of linear regressions than model-based alternatives.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://archive.ics.uci.edu/ml/datasets.html.

  2. 2.

    https://gist.github.com/joneill87/b19ccf79df3195bda4960210ee14c96d.

References

  1. Settles, B.: Active Learning Literature Survey, vol. 52, pp. 55–66. University of Wisconsin, Madison (2010)

    Google Scholar 

  2. Settles, B., Craven, M., Friedland, L.: Active learning with real annotation costs. In: Proceedings of the NIPS Workshop on Cost-Sensitive Learning, pp. 1–10 (2008)

    Google Scholar 

  3. Margineantu, D.D.: Active cost-sensitive learning. IJCAI 5, 1622–1623 (2005)

    Google Scholar 

  4. Hu, R., Delany, S.J., Mac Namee, B.: Egal: Exploration guided active learning for tcbr. In: Case-Based Reasoning. Research and Development, pp. 156–170. Springer (2010)

    Google Scholar 

  5. Cai, W., Zhang, Y., Zhou, J.: Maximizing expected model change for active learning in regression. In: 2013 IEEE 13th International Conference on Data Mining (ICDM), pp. 51–60, IEEE (2013)

    Google Scholar 

  6. Castro, R.M., Willett, R., Nowak, R.: Faster rates in regression via active learning. In: Advances in Neural Information Processing Systems, pp. 179–186 (2005)

    Google Scholar 

  7. Sung, K.K., Niyogi, P.: Active learning for function approximation. In: Advances in Neural Information Processing Systems, pp. 593–600 (1995)

    Google Scholar 

  8. Yu, K., Bi, J., Tresp, V.: Active learning via transductive experimental design. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 1081–1088. ACM (2006)

    Google Scholar 

  9. Seung, H.S., Opper, M., Sompolinsky, H.: Query by committee. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 287–294. ACM (1992)

    Google Scholar 

  10. Burbidge, R., Rowland, J.J., King, R.D.: Active Learning for Regression Based on Query by Committee, pp. 209–218. Springer (2007) (state-of-the-art)

    Google Scholar 

  11. Settles, B., Craven, M., Ray, S.: Multiple-instance active learning. In: Advances in Neural Information Processing Systems, pp. 1289–1296 (2008)

    Google Scholar 

  12. Baram, Y., El-Yaniv, R., Luz, K.: Online choice of active learning algorithms. J. Mach. Learn. Res. 5, 255–291 (2004)

    MathSciNet  Google Scholar 

  13. Osugi, T., Kim, D., Scott, S.: Balancing exploration and exploitation: a new algorithm for active machine learning. In: Fifth IEEE International Conference on Data Mining, pp. 8. IEEE (2005)

    Google Scholar 

  14. Boiy, E., Moens, M.-F.: A machine learning approach to sentiment analysis in multilingual web texts. Inf. Retriev. 12(5), 526–558 (2009)

    Article  Google Scholar 

  15. Mac Namee, B., Hu, R., Delany, S.: Inside the selection box: visualising active learning selection strategies. In: The Challenges of Data Visualization Neural Information Processing Systems, (NIPS) 2010 Workshop (2010)

    Google Scholar 

  16. Hu, R.: Active learning for text classification. Thesis, Dublin Institute of Technology (2011)

    Google Scholar 

  17. Settles, B.: From theories to queries: active learning in practice. In: Active Learning and Experimental Design Workshop, pp. 1–18 (2011)

    Google Scholar 

  18. Xu, Z., Akella, R., Zhang, Y.: Incorporating Diversity and Density in Active Learning for Relevance Feedback. Springer (2007)

    Google Scholar 

  19. Hoi, S.C., Jin, R., Zhu, J., Lyu, M.R.: Batch mode active learning and its application to medical image classification. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 417–424. ACM (2006)

    Google Scholar 

  20. Guo, Y., Schuurmans, D.: Discriminative batch mode active learning. In: Advances in Neural Information Processing Systems, pp. 593–600 (2008)

    Google Scholar 

  21. Hu, R., Mac Namee, B., Delany, S.J. : Sweetening the dataset: using active learning to label unlabelled datasets. In: Proceedings of the 19th Irish Conference on Artificial Intelligence and Cognitive Science (AICS ’08) (2008)

    Google Scholar 

  22. Ray Chaudhuri, T., Hamey, L.G.: Minimisation of data collection by active learning. In: Proceedings. IEEE International Conference on Neural Networks, 1995, vol. 3, pp. 1338–1341, IEEE (1995)

    Google Scholar 

  23. Borchers, H.W.: pracma: Practical Numerical Math Functions (2015) (R package version 1.8.6)

    Google Scholar 

  24. Haertel, R., Felt, P., Ringger, E., Seppi, K.: Parallel active learning: eliminating wait time with minimal staleness. In: Proceedings of the NAACL HLT 2010 Workshop on Active Learning for Natural Language Processing, pp. 33–41. Association for Computational Linguistics (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jack O’Neill .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

O’Neill, J., Jane Delany, S., MacNamee, B. (2017). Model-Free and Model-Based Active Learning for Regression. In: Angelov, P., Gegov, A., Jayne, C., Shen, Q. (eds) Advances in Computational Intelligence Systems. Advances in Intelligent Systems and Computing, vol 513. Springer, Cham. https://doi.org/10.1007/978-3-319-46562-3_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-46562-3_24

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-46561-6

  • Online ISBN: 978-3-319-46562-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics