The learning of action sequences through social transmission
- 609 Downloads
Previous empirical work on animal social learning has found that many species lack the ability to learn entire action sequences solely through reliance on social information. Conversely, acquiring action sequences through asocial learning can be difficult due to the large number of potential sequences arising from even a small number of base actions. In spite of this, several studies report that some primates use action sequences in the wild. We investigate how social information can be integrated with asocial learning to facilitate the learning of action sequences. We formalize this problem by examining how learners using temporal difference learning, a widely applicable model of reinforcement learning, can combine social cues with their own experiences to acquire action sequences. The learning problem is modeled as a Markov decision process. The learning of nettle processing by mountain gorillas serves as a focal example. Through simulations, we find that the social facilitation of component actions can combine with individual learning to facilitate the acquisition of action sequences. Our analysis illustrates that how even simple forms of social learning, combined with asocial learning, generate substantially faster learning of action sequences compared to asocial processes alone, and that the benefits of social information increase with the length of the action sequence and the number of base actions.
KeywordsSocial learning Sequence learning Temporal difference learning Markov decision process Chaining
The authors thank Richard Byrne for his very helpful comments. DC thanks Magnus Enquist and Johan Lind for countless inspiring conversations and in particular for pointing out the importance of sequence learning, and the potential of Markov decision processes for modeling such learning problems. This research was supported by a grant from The John Templeton Foundation.
Conflict of interest
The authors declare that they have no conflicts of interest.
This article does not contain any studies with human participants or animals performed by any of the authors.
- Boesch C, Boesch H (1982) Optimisation of nut-cracking with natural hammers by wild chimpanzees. Behaviour 88(3):265–286Google Scholar
- Boyd R, Richerson PJ (1985) Culture and the evolutionary process. University of Chicago Press, ChicagoGoogle Scholar
- Byrne RW (1999) Cognition in great ape ecology. Skill-learning ability opens up foraging opportunities. Symp Zool Soc Lond 72:333–350Google Scholar
- Enquist M, Eriksson K, Ghirlanda S (2007) Critical social learning: a solution to Rogers's paradox of nonadaptive culture. Am Anthropol 109(4):727–734Google Scholar
- Glimcher PW (2011) Understanding dopamine and reinforcement learning: the dopamine reward prediction error hypothesis. Proc Natl Acad Sci USA 108(15):647–654Google Scholar
- Heyes CM, Galef BG (1996) Social learning in animals: the roots of culture. Elsevier, AmsterdamGoogle Scholar
- Laland KN, Galef BG (2009) The question of animal culture. Harvard University Press, CambridgeGoogle Scholar
- Rescorla RA, Wagner AR (1972) A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. In: Black AH, Prokasy WF (eds) Classical conditioning: current research and theory. Appleton-Century-Crofts, New York, pp 64–99Google Scholar
- Sutton RS, Barto AG (1990) Time-derivative models of Pavlovian reinforcement. In: Gabriel M, Moore J (eds) Learning and computational neuroscience: foundations of adaptive networks. MIT Press, Cambridge, pp 497-537Google Scholar
- Sutton RS, Barto AG (1998) Reinforcement learning: AN introduction, vol 1. Cambridge Univ Press, CambridgeGoogle Scholar
- Tan M (1993) Multi-agent reinforcement learning: independent vs. cooperative agents. In: Proceedings of the tenth international conference on machine learning, pp 330–337Google Scholar
- Zentall TR, Galef BG Jr (1988) Social learning: psychological and biological perspectives. Lawrence Erlbaum Associates, HillsdaleGoogle Scholar