Skip to main content

The AmTriangle Meta-Dataset for Playing with Machine Learning

  • Conference paper
  • First Online:
Perspectives and Trends in Education and Technology

Abstract

AmTriangle is a meta-dataset for machine learning (ML) that supports convenient and configurable dataset production in addition to a classifier that performs supervised learning. The idea is to have a never-ending source of supervised datasets, minimizing learning barriers to ML, for training models, and facilitating educational experiments and comparisons between different workflows. It is a “meta-dataset” because it is a solution to generate sets of samples (datasets), as many as desired, correctly classified. Each sample is a triangle, classified as “acute”, “obtuse” or “right”, according to trigonometry. Each generated dataset can be used to “teach-by-example” how to classify new samples, by different techniques, namely K-Nearest-Neighbors and neural networks. The triangles could be other analogous objects, other tuples.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Alzubi, J., Nayyar, A., Kumar, A.: Machine learning from theory to algorithms: an overview. J. Phys: Conf. Ser. 1142, 012012 (2018). https://doi.org/10.1088/1742-6596/1142/1/012012

    Article  Google Scholar 

  2. Gangadhar, S., Shanta, R.: Chapter 8—Machine learning. Handbook Stat. 38, 197–228 (2018). https://doi.org/10.1016/bs.host.2018.07.004

  3. Jaime, G.C., Ryszard, S.M., Tom, M.M.: 1—an overview of machine learning. 3–23 (1983). https://doi.org/10.1016/B978-0-08-051054-5.50005-4

  4. Caruana, R., Niculescu-Mizil, A.: An empirical comparison of supervised learning algorithms. Icml 06, 161–168 (2006). https://doi.org/10.1145/1143844.1143865

  5. Dridi, S.: Supervised Learning—A Systematic Literature Review (2021)

    Google Scholar 

  6. Barlow, H.B.: Unsupervised learning. Neural Comput. 1(3), 295–311 (1989). https://doi.org/10.1162/neco.1989.1.3.295

    Article  Google Scholar 

  7. Ghahramani, Z.: Unsupervised Learning. In: Bousquet, O., von Luxburg, U., Rätsch, G. (eds.) Advanced lectures on machine learning: ML Summer schools 2003, Canberra, Australia, February 2–14, 2003, Tübingen, Germany, August 4–16, 2003, Revised Lectures, pp. 72–112. Springer, Berlin Heidelberg, Berlin, Heidelberg (2004)

    Chapter  Google Scholar 

  8. Insights, D.: The Fourth Industrial Revolution (2020)

    Google Scholar 

  9. Chui, M., Manyika, J., Miremadi, M.: Where machines could replace humans—and where they can't (yet) (2016)

    Google Scholar 

  10. Zhang, Z.: Introduction to machine learning: k-nearest neighbors. Ann Transl Med 4(11), 218–218 (2016). https://doi.org/10.21037/atm.2016.03.37

    Article  Google Scholar 

  11. Dietterich, T.: Overfitting and undercomputing in machine learning. ACM Comput. Surv. 27(3), 326–327 (1995). https://doi.org/10.1145/212094.212114

    Article  Google Scholar 

  12. Roelofs, R., Shankar, V., Recht, B., Fridovich-Keil, S., Hardt, M., Miller, J., Schmidt, L.: A meta-analysis of overfitting in machine learning. In: Wallach, H., Larochelle, H., Beygelzimer, A., Alché-Buc, F.d., Fox, E., Garnett, R. (eds.) Advances in Neural Information Processing Systems, vol. 32. Curran Associates, Inc. %U (2019). https://proceedings.neurips.cc/paper/2019/file/ee39e503b6bedf0c98c388b7e8589aca-Paper.pdf

  13. Fisher, R.A.: The use of multiple measurements in taxonomic problems. Ann. Eugen. 7(2), 179–188 (1936). https://doi.org/10.1111/j.1469-1809.1936.tb02137.x

    Article  Google Scholar 

  14. Wolberg, D.W.H., Street, W.N., Mangasarian, O.L.: Breast Cancer Wisconsin (Diagnostic) Data Set (1995)

    Google Scholar 

  15. Alwosheel, A., van Cranenburgh, S., Chorus, C.G.: Is your dataset big enough? Sample size requirements when using artificial neural networks for discrete choice analysis. J. Choice Model. 28, 167–182 (2018). https://doi.org/10.1016/j.jocm.2018.07.002

    Article  Google Scholar 

  16. Triantafillou, E., Zhu, T., Dumoulin, V., Lamblin, P., Evci, U., Xu, K., Goroshin, R., Gelada, C., Swersky, K., Manzagol, P.-A., Larochelle, H.: Meta-dataset: a dataset of datasets for learning to learn from few examples. In: International Conference on Learning Representations (2020)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Artur Marques .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Marques, A., Amorim Silva, R.d., Madeira, F. (2023). The AmTriangle Meta-Dataset for Playing with Machine Learning. In: Mesquita, A., Abreu, A., Carvalho, J.V., de Mello, C.H.P. (eds) Perspectives and Trends in Education and Technology . Smart Innovation, Systems and Technologies, vol 320. Springer, Singapore. https://doi.org/10.1007/978-981-19-6585-2_22

Download citation

Publish with us

Policies and ethics