Disruptive Innovations in RoboCup 2D Soccer Simulation League: From Cyberoos’98 to Gliders2016

Prokopenko, Mikhail; Wang, Peter

doi:10.1007/978-3-319-68792-6_44

Disruptive Innovations in RoboCup 2D Soccer Simulation League: From Cyberoos’98 to Gliders2016

Mikhail Prokopenko¹⁷ &
Peter Wang¹⁸

Conference paper
First Online: 01 November 2017

1838 Accesses
6 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9776))

Abstract

We review disruptive innovations introduced in the RoboCup 2D Soccer Simulation League over the twenty years since its inception, and trace the progress of our champion team (Gliders). We conjecture that the League has been developing as an ecosystem shaped by diverse approaches taken by participating teams, increasing in its overall complexity. A common feature is that different champion teams succeeded in finding a way to decompose the enormous search-space of possible single and multi-agent behaviours, by automating the exploration of the problem space with various techniques which accelerated the software development efforts. These methods included interactive debugging, machine learning, automated planning, and opponent modelling. The winning approach developed by Gliders is centred on human-based evolutionary computation which optimised several components such as an action-dependent evaluation function, dynamic tactics with Voronoi diagrams, information dynamics, and bio-inspired collective behaviour.

You have full access to this open access chapter, Download conference paper PDF

1 Introduction

Agent Smith: “You can’t win, it’s pointless to keep fighting!

Why, Mr. Anderson? Why do you persist?”

Neo: “Because I choose to.”The Matrix Revolutions.

The first official RoboCup was held in 1997, proposing a new benchmark for Artificial Intelligence (AI) and robotics. Incidentally, another classical AI challenge was successfully met in May 1997 when IBM Deep Blue defeated the human world champion in chess. By design, RoboCup and chess differ in a few key elements: environment (static vs dynamic), state change (turn-taking vs real-time), information accessibility (complete vs incomplete), sensor readings (symbolic vs non-symbolic), and control (central vs distributed) [1]. These differences are emphasised in the RoboCup 2D Soccer Simulation League [2], which quickly gained prominence, becoming one of the largest RoboCup leagues.

In this league, two teams of 12 fully autonomous software programs (called “agents”) play soccer in a two-dimensional virtual soccer stadium (11 player agents and 1 coach agent in each team), with no remote control. Each player agent receives relative and noisy input from its virtual sensors (visual, acoustic and physical) and may perform some basic actions in order to influence its environment, e.g., running, turning and kicking the ball. The coach agent receives perfect input but can communicate with the player agents only infrequently and through a fairly limited channel. The ability to simulate soccer matches without physical robots abstracts away low-level issues such as image processing and motor breakages, allowing teams to focus on the development of complex team behaviours and strategies for a larger number of autonomous agents [3, 4].

A simulated game lasts just over 10 min on average, and is played over a small network of computer workstations which execute the code in parallel. Each simulation step takes merely a tenth of a second, during which the entire sensory-motor cycle takes place within an agent: starting with receiving new sensory inputs from the simulator, proceeding to updating the internal memory, to evaluating possible choices, to sending the chosen action back to the simulator. The main challenge for each agent is to derive the best possible action to execute at any specific time, while facing unexpected actions of the opposing agents.

Over 20 years, the RoboCup community has developed the open-source 2D simulator and visualisation software which currently, with various packaged utilities and basic agent libraries, contains nearly a million lines of code. During this period, the League and the participating teams have undergone several transitions each of which eventually expanded the level of agents’ intelligence and their behavioral complexity. In this paper we attempt to trace not only the ten-year long progress of our own team from its first implementation (Cyberoos; participated between 1998 and 2003) to the RoboCup-2016 champion team (Gliders; competed first in 2012), but also put this trace in the context of the twenty-year long evolution of the Sim2D League itself.

The conjecture we put forward is that the League has been developing as an ecosystem with an increasing complexity shaped by different approaches taken by participating teams. Furthermore, this evolving ecosystem has experienced a series of salient transitions leading to emergence of qualitatively new properties in the intelligence exhibited by the agents. By a transition we do not mean a mere extension of some simulated capabilities, such as the introduction of goalkeepers, heterogeneous player types, or a coach language. Instead, we associate a transition with a specific methodological advance which played the role of a disruptive innovation, with wide-spread consequences affecting the entire “ecosystem”, for example, a release of standard libraries, and so on. We use the term “disruptive innovation” in a broad sense to indicate an innovation that creates a new ecosystem (by analogy with a new market or value network), eventually disrupting an existing system, displacing established structures and relationships.

2 A Simulated World

The foundation supporting the evolution of the League is undoubtedly the construction of the soccer server itself, providing a centralised world model with several key features, enhanced over the following years:

distributed client/server system running on a network and producing fragmented, localised and imprecise (noisy and latent) information about the environment (virtual soccer field) [5, 6];
concurrent communication with a number of autonomous agents [7];
heterogeneous sensory data (visual, auditory, kinetic) without a global vision, and limited range of basic commands/effectors (turn, kick, dash, \(\ldots \)) [8];
asynchronous perception-action activity and limited window of opportunity to perform an action [9];
autonomous decision-making under constraints enforced by teamwork (collaboration) and opponent (competition) [10];
conflicts between reactivity and deliberation [11].

The only restriction that was imposed from the outset is that participants should “never use central control mechanisms to control a team of agents” [12].

A crucial feature making this simulated world an evolving “ecosystem” is the availability of binaries (and sometimes the source code) of participating teams, contained within an online team repository. The repository is updated after each annual RoboCup competition, allowing the participants to improve their teams with respect to the top teams of the previous championships. These improvements diversify the teams’ functionality and explore the immense search-space of possible behaviours in the quest for optimal solutions. This process results in a co-evolution of the teams, raising the overall competition level.

3 Partial Automation of Development Efforts

“AT Humboldt” from Humboldt University, Germany became the first champion of the League at RoboCup-1997 (Nagoya, Japan). The team used a combination of reactive and planning systems, successfully deploying its agents within the simulated world.

The following couple of years passed under the domination of “CMUnited” team from Carnegie Mellon University (USA) which took the championship in 1998 (Paris, France) and 1999 (Stockholm, Sweden). One of the key reasons for this success was the development of several tools partially automating the overall effort, such as an offline agent training module, and layered disclosure: a technique for disclosing to a human designer the specific detailed reasons for an agent actions (in run-time or retroactively). Layered disclosure made it possible to inspect the details of an individual player’s decision-making process at any point [13], becoming in our view the first disruptive innovation in the League. Together with the offline agent training module, it clearly exemplified the power of automation in accelerating the development effort—precisely because it enabled the design effort to reach into a larger part of the search-space by encoding more diverse behaviours.

It is important to point out that there were other novelties introduced by CMUnited-98 and CMUnited-99, such as “single-channel, low-bandwidth communication”, “predictive, locally optimal skills (PLOS)”, “strategic positioning using attraction and repulsion (SPAR)”, etc. [13], but we believe that it is the partial automation of the software development that became the disruptive innovation. It has led to a wide-spread adoption of several debugging, visualising, log-playing, log-analysing, and machine learning tools.

4 Configurational Space

A number of new teams in 2000 utilised the code base of the 1999 champions, CMUnited-99: it provided code for interaction with the soccer server, skills, strategies, and debugging tools in a variety of programming languages [14]. The champion of RoboCup-2000 held in Melbourne, Australia, “FC Portugal” from University of Aveiro and University of Porto, extended this code base with a systematic approach to describing team strategy, the concepts of tactics, formations and player types, as well as the situation based strategic positioning, the dynamic positioning and role exchange mechanisms [11, 15].

The generic innovation underlying these mechanisms comprised the ability to configure diverse single- and multi-agent behaviours. The range of these behaviours span from active (ball possession) to strategic (ball recovery), from formations to tactics, and from individual skills to team strategies. Such diversity resulted in a considerable configurational flexibility displayed by the winning team, significantly increasing the software development productivity, and more importantly, expanding the extent of the available behavioural search-space.

Not surprisingly, the expansion brought about by the larger configurational capacity was further exploited by the introduction of a standard coach language [16] enabling high-level coaching with explicit definition of formations, situations, player types and time periods, and resulting in a high-level coordination of team behaviour. In other words, a disruptive innovation again was delivered by a method which allowed to access deeper regions of the available search-space.

Team “TsinghuAeolus” from Tsinghua University, China, which won the next two championships (RoboCup-2001 in Seattle, USA, and RoboCup-2002 in Fukuoka), focussed specifically on increasing the agents’ adaptability via a novel online advice-taking mechanism [17]. The configurational space was extended by a task-decomposition mechanism that assigned different parts of the task to different agents.

A major boost to the League was provided by the partial release of the source code of the next champion, team “UvA Trilearn” from University of Amsterdam, The Netherlands, which won RoboCup-2003 in Padua, Italy [18]. This release resulted in a standardisation of many low-level behaviours and world model, effectively “locking in” the configurational space attained by that time, and motivating several teams to switch their code base to UvA Trilearn base.

5 Cyberoos: 1998–2003

At this stage we take a brief look at our first team, Cyberoos, which participated in RoboCup competitions between 1998 and 2003. The Cyberoos’98 team took \(3^{rd}\) place in the 1998 Pacific Rim RoboCup competition [19], while Cyberoos’2000 were \(4^{th}\) in the Open European RoboCup-2000 [9]. Despite these regional successes, the team’s best result at the world stage was a shared \(9^{th}\) place which Cyberoos repeatedly took at the RoboCup competitions in 2000, 2001, 2002 and 2003, never reaching the quarter-finals [20,21,22,23]. In hindsight, the main reason for this lack of progress was an oversight of the main tendency driving the innovations in the League: the exploration of the search-space due to the automation of the development efforts and the standardisation of the configurational space.

Instead, the approach taken by Cyberoos focussed on self-organisation of emergent behaviour within a purely reactive agent architecture [21]. Only during the later years the Cyberoos architecture diversified, and included semi-automated methods that quantified the team performance in generic information-theoretic terms [22, 23]. This approach focussed on measuring the behavioural and belief dynamics in multi-agent systems, offering a possibility to evolve the team behaviour, optimised under a universal objective function, within the framework of information-driven self-organisation [24,25,26]. However, this framework has started to take a functional shape only a few years later, after the time when the Cyberoos team effort stopped in 2003.

6 Search-Space Decomposition

The next decade of RoboCup championships witnessed an intense competition between three teams: “Brainstormers” from University of Osnabrück, Germany, “WrightEagle” from University of Science and Technology of China, and “HELIOS” from Fukuoka University and Osaka Prefecture University, Japan. Brainstormers became champions three times: in 2005 (Osaka, Japan), 2007 (Atlanta, USA), and 2008 (Suzhou, China); WrightEagle came first an incredible six times: in 2006 (Bremen, Germany), 2009 (Graz, Austria), 2011 (Istanbul, Turkey), 2013 (Eindhoven, The Netherlands), 2014 (Joao Pessoa, Brazil) and 2015 (Hefei, China); and HELIOS succeeded twice: in 2010 (Singapore) and 2012 (Mexico City, Mexico).

6.1 Machine Learning

Brainstormers’ effort focussed on reinforcement learning methods aiming at a universal machine learning system, where the agents learn to generate the appropriate behaviors to satisfy the most general objective of “winning the match”. Unfortunately, as has been acknowledged [27], “even from very optimistic complexity estimations it becomes obvious, that in the soccer simulation domain, both conventional solution methods and also advanced today’s reinforcement learning techniques come to their limit – there are more than \((108 \times 50)^{23}\) different states and more than \((1000)^{300}\) different policies per agent per half time”.

The high dimensionality of the search space motivated Brainstormers to use a multilayer perceptron neural network [27]: a feedforward artificial neural network which utilises a supervised learning technique called backpropagation for training the network. Rather than developing a universal learning system, Brainstormers succeeded in decomposing the problem into a number of individual behaviours (e.g., NeuroKick, NeuroIntercept, NeuroHassle) and tactics (e.g., NeuroAttack2vs2, NeuroAttack3vs4, NeuroAttack7vs8), learned with supervised learning techniques.

Recently, there has been some renewed interest in backpropagation networks due to the successes of deep learning. In our view, the potential of reinforcement learning methods in RoboCup has not yet been fully realised, and deep learning may yet to become a disruptive innovation for the Simulation league.

6.2 Automated Planning

WrightEagle team addressed the challenges of (i) high dimensionality of the search space and (ii) the limited computation time available in each decision cycle, by using Markov Decision Processes (MDPs). The developed framework decomposes a given MDP into a set of sub-MDPs arranged over a hierarchical structure, and includes heuristics approximating online planning techniques [28]. WrightEagle approach abandoned “the pursuit of absolute accuracy” and divided the continuous soccer field into the discrete space, further subdividing it into the players’ control areas according to geometric reachability. The resultant structure enables automated planning, accelerating the search process and extending the search depth [28].

6.3 Opponent Modelling

“HELIOS” team [29, 30] followed a similar path, targeting a decomposition of the problem space in developing an unsupervised learning method based on Constrained Delaunay Triangulation (CDT) [31]. A Delaunay triangulation for a set P of points in a plane is a triangulation \(\mathcal {D}(P)\) such that no point in P is inside the circumcircle of any triangle in \(\mathcal {D}(P)\) (in CDT the circumcircle of some triangles contains other triangles’ vertices). The method divides the soccer field into a set of triangles, which provide an input plane region for Neural Gas (NG) and Growing Neural Gas (GNG) methods. Specifically, the set \(P_b\) of N points represents specifically chosen positions of the ball on the field, while sets \(P_i\) describe the sets of coordinates of each player \(1 \le i \le 11\), so that there is a bijective correspondence between \(P_b\) and each of \(P_i\). Moreover, when the ball takes any position within a triangle of \(\mathcal{{D}}(P_b)\), each player’s position is computed in a congruent way within \(\mathcal{{D}}(P_i)\). During offline experiments or even during a game, the behaviour of the opponent, for example, the players’ motion, directions of the passes, and the overall team formations, can be mapped, analysed and categorised [29, 30].

It is evident that the main reason behind the recurrent successes of all three champion approaches is a dynamic decomposition of the problem space and its subsequent efficient exploration. This innovation goes beyond a simple standardisation of low-level behaviours within a rich but static configurational space, by employing automated learning and planning methods in a dynamic search.

7 Standardisation of “Hardware”

An influential disruptive innovation arrived in 2010, when HELIOS team released a major update of their well-developed code base [32]:

librcsc-4.0.0: a base library for the RoboCup Soccer Simulator (RCSS);
agent2d-3.0.0: a base source code for a team;
soccerwindow2-5.0.0: a viewer and a visual debugger program for RCSS;
fedit2-2.0.0: a team formation editor for agent2d.

This resulted in nearly 80% of the League’s teams switching their code base to agent2d over the next few years. One may think of this phenomenon as a standardisation of the simulated hardware, freeing the effort to improving the higher-level tactical behaviours.

8 Gliders (2012–2016): Fusing Human Innovation and Artificial Evolution

We turn our attention to our champion team which won RoboCup-2016 (Leipzig, Germany): Gliders [33,34,35,36,37]. Gliders2012 and Gliders2013 reached the semi-finals of RoboCup in 2012 and 2013; Gliders2014 were runner-ups in 2014; Gliders2015 finished third in RoboCup-2015, and Gliders2016 (a joint effort of the University of Sydney and CSIRO) became world champions in 2016.

RoboCup-2016 competition included 18 teams from 9 countries: Australia, Brazil, China, Egypt, Germany, Iran, Japan, Portugal and Romania. Gliders2016 played 23 games during several rounds, winning 19 times, losing twice and drawing twice, with the total score of 62:13, or 2.70:0.57 on average. In the two-game semi-final round, Gliders2016 defeated team CSU_Yunlu from Central South University (China), winning both games with the same score 2:1. The single-game final against team HELIOS2016 (Japan) went into the extra time, and ended with Gliders2016 winning 2:1. The third place was taken by team Ri-one from Ritsumeikan University (Japan).

The 2016 competition also included an evaluation round, where all 18 participating teams played one game each against the champion of RoboCup-2015, team WrightEagle (China). Only two teams, the eventual finalists Gliders2016 and HELIOS2016, managed to win against the previous year champion, with Gliders defeating WrightEagle 1:0, and HELIOS producing the top score 2:1.

The Gliders team code is written in C++ using agent2d-3.1.1 [32], and fragments of source code of team MarliK released in 2012 [38].

In order to optimise the code, the Gliders development effort over the last five years involved human-based evolutionary computation (HBEC): a set of evolutionary computation techniques that rely on human innovation [39, 40].

In general, evolutionary algorithms search a large space of possible solutions that together form a population. Each solution is a “genotype”: a complex data structure representing the entire team behaviour encoded through a set of “design points”. A design point can be as simple as a single parameter (e.g., risk tolerance in making a pass), or as complicated as a multi-agent tactical behaviour (e.g., a conditional statement describing the situation when a defender moves forward to produce an offside trap).

Some design points are easy to vary. For instance, a formation defined via Delaunay Triangulations \(\mathcal{{D}}(P_b)\) and \(\mathcal{{D}}(P_i)\), \(1 \le i \le 11\), is an ordered list of coordinates, and varying and recombining such a list can be relatively easily automated. Other design points have an internal structure and are harder to permute. For example, a conditional statement describing a tactic has a condition and an action, encoded by numerous parameters such as positional coordinates, state information, and action details. Once such a statement (a design point) is created by human designers, its encoding can be used by evolutionary algorithms. However, the inception of the tactic needs creative innovation in the first place, justifying the hybrid HBEC approach.

The HBEC solutions representing team behaviours are evaluated with respect to their fitness, implemented as the average team performance, estimated over thousands of games for each generation played against a specific opponent. Some solutions are retained and recombined (i.e., the members of the population live) and some are removed (i.e., die) through selection. Importantly, the evolutionary process is carried out within different landscapes (one per known opponent), and typically results in different solutions evolved to outperform specific opponents. In order to maintain coherence of the resultant code, each design point is implemented with a logical mask switching the corresponding part of the genotype on and off for specific opponents (determined by their team names). This is loosely analogous to epigenetic programming [41].

The approach is aimed at constantly improving performance from one artificial “generation" to another, with team designers innovating and recombining behaviours while the fitness landscape and the mutations are for the most part automated. The performance of Gliders was evaluated on several supercomputer clusters, executing on some days tens of thousands of the experimental runs with different behaviour versions. It would be a fair estimate that the number of such trials is approaching 10 million. The overall search-space explored by the HBEC includes variations in both Gliders behaviour and opponent modelling. The approach incorporates disruptive innovations of the past years, including the standardisation of simulated “hardware” and several effective search-space decompositions.

Specific variations included (i) action-dependent evaluation function, (ii) dynamic tactics with Voronoi diagrams, (iii) information dynamics, and (iv) bio-inspired collective behaviour.

The approach introduced in Gliders2012 [33] retained the advantages of a single evaluation metric (implemented in agent2d [32]), but diversified the evaluation by considering multiple points as desirable states. These desirable states for action-dependent evaluation are computed using Voronoi diagrams which underlie many tactical schemes of Gliders.

Starting from 2013, Gliders utilised information dynamics [42,43,44,45,46,47] for tactical analysis and opponent modelling. This analysis involves computation of information transfer and storage, relating the information transfer to responsiveness of the players, and the information storage within a team to the team’s rigidity and lack of tactical richness.

The constraints on mobility, identified by the information dynamics, were investigated and partially overcome with bio-inspired collective behaviour [36]. Gliders2015 utilise several elements of swarm behavior, attempting to keep each player’s position as close as possible to that suggested by a specific tactical scheme, while incorporating slight variations in order to maximise the chances of receiving the pass and/or shooting at the opponent’s goal. This behaviour increased the degree of coherent mobility: on the one hand, the players are constantly refining their positions in response to opponent players, but on the other hand, the repositioning is not erratic and the players move in coordinated ways.

These directions were unified within a single development and evaluation framework which allowed to explore the search-space in two ways: translating human expertise into new behaviours and tactics, and exhaustively recombining them with an artificial evolution, leveraging the power of modern supercomputing. This fusion, we believe, produced a disruptive innovation on its own, providing the winning edge for Gliders.

9 Conclusion

In this paper we reviewed disruptive innovations which affected advancement of the RoboCup 2D Soccer Simulation League over the twenty years since its inception, and placed the progress of our champion team in this context. It is important to realise that the neither of these processes has been linear, and many ideas have been developing along a spiral-shaped trajectory, resurfacing over the years in a different implementation. For example, the utility of evolutionary computation supported by supercomputing has been suggested as early as 1997, when a simulated team was developed with the agents whose high-level decision making behaviors had been entirely evolved using genetic programming [48]. Yet the complexity of the domain proved to be too challenging for this approach to gain a widespread adoption at that time.

Without an exception, all the winning approaches combined elements of some automation (debugging, machine learning, planning, opponent modelling, evolutionary computation) with human-based innovation in terms of a decomposition of the search-space, providing various configurations, templates and structures. Is there still a way toward a fully automated solution, when the agents learn or evolve to play a competitive game without a detailed guidance from human designers, but rather by trying to satisfy a universal objective (“win a game”)?

On the one hand, the ability to run a massive number of simulated games on supercomputing clusters producing replicable results will only strengthen in time [4], and so may lend some hope in meeting this challenge positively. On the other hand, the enormous size and dimensionality of the search-space would defy any unstructured exploration strategy. A methodology successfully resolving this dilemma may not only provide an ultimate disruptive innovation in the League, but also provide a major breakthrough in the general AI research.

References

Asada, M., Kitano, H., Noda, I., Veloso, M.: RoboCup: today and tomorrow - what we have have learned. Artif. Intell. 110, 193–214 (1999)
Article Google Scholar
Kitano, H., Tambe, M., Stone, P., Veloso, M.M., Coradeschi, S., Osawa, E., Matsubara, H., Noda, I., Asada, M.: The RoboCup synthetic agent challenge 97. In: Kitano, H. (ed.) RoboCup 1997. LNCS, vol. 1395, pp. 62–73. Springer, Heidelberg (1998). https://doi.org/10.1007/3-540-64473-3_49
Chapter Google Scholar
Budden, D., Wang, P., Obst, O., Prokopenko, M.: Simulation leagues: analysis of competition formats. In: Bianchi, R.A.C., Akin, H.L., Ramamoorthy, S., Sugiura, K. (eds.) RoboCup 2014. LNCS (LNAI), vol. 8992, pp. 183–194. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18615-3_15
Chapter Google Scholar
Budden, D.M., Wang, P., Obst, O., Prokopenko, M.: Robocup simulation leagues: enabling replicable and robust investigation of complex robotic systems. IEEE Robot. Autom. Mag. 22(3), 140–146 (2015)
Article Google Scholar
Noda, I., Stone, P.: The RoboCup soccer server and CMUnited clients: implemented infrastructure for MAS research. Auton. Agent. Multi-Agent Syst. 7(1–2), 101–120 (2003)
Article Google Scholar
Haker, M., Meyer, A., Polani, D., Martinetz, T.: A method for incorporation of new evidence to improve world state estimation. In: Birk, A., Coradeschi, S., Tadokoro, S. (eds.) RoboCup 2001. LNCS (LNAI), vol. 2377, pp. 362–367. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-45603-1_44
Chapter MATH Google Scholar
Stone, P., Veloso, M.: Task decomposition, dynamic role assignment, and low-bandwidth communication for real-time strategic teamwork. Artif. Intell. 110(2), 241–273 (1999)
Article Google Scholar
Riley, P., Stone, P., Veloso, M.: Layered disclosure: revealing agents’ internals. In: Castelfranchi, C., Lespérance, Y. (eds.) ATAL 2000. LNCS, vol. 1986, pp. 61–72. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-44631-1_5
Chapter MATH Google Scholar
Butler, M., Prokopenko, M., Howard, T.: Flexible synchronisation within RoboCup environment: a comparative analysis. In: Stone, P., Balch, T., Kraetzschmar, G. (eds.) RoboCup 2000. LNCS, vol. 2019, pp. 119–128. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-45324-5_10
Chapter Google Scholar
Stone, P., Riley, P., Veloso, M.: Defining and using ideal teammate and opponent models. In: Proceedings of the Twelfth Annual Conference on Innovative Applications of Artificial Intelligence (2000)
Google Scholar
Reis, L.P., Lau, N., Oliveira, E.C.: Situation based strategic positioning for coordinating a team of homogeneous agents. BRSDMAS 2000. LNCS, vol. 2103, pp. 175–197. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-44568-4_11
Chapter Google Scholar
Noda, I., Suzuki, S., Matsubara, H., Asada, M., Kitano, H.: Robocup-97: the first robot world cup soccer games and conferences. AI Mag. 19(3), 49–59 (1998)
Google Scholar
Stone, P., Riley, P., Veloso, M.: The CMUnited-99 champion simulator team. In: Veloso, M., Pagello, E., Kitano, H. (eds.) RoboCup 1999. LNCS (LNAI), vol. 1856, pp. 35–48. Springer, Heidelberg (2000). https://doi.org/10.1007/3-540-45327-X_2
Chapter Google Scholar
Stone, P., Asada, M., Balch, T., Fujita, M., Kraetzschmar, G., Lund, H., Scerri, P., Tadokoro, S., Wyeth, G.: Overview of Robocup-2000. In: Stone, P., Balch, T., Kraetzschmar, G. (eds.) RoboCup 2000. LNCS, vol. 2019, pp. 1–29. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-45324-5_1
Chapter MATH Google Scholar
Reis, L.P., Lau, N.: FC Portugal team description: RoboCup 2000 simulation league champion. In: Stone, P., Balch, T., Kraetzschmar, G. (eds.) RoboCup 2000. LNCS, vol. 2019, pp. 29–40. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-45324-5_2
Chapter Google Scholar
Reis, L.P., Lau, N.: COACH UNILANG - a standard language for coaching a (Robo) soccer team. In: Birk, A., Coradeschi, S., Tadokoro, S. (eds.) RoboCup 2001. LNCS, vol. 2377, pp. 183–192. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-45603-1_19
Chapter Google Scholar
Jinyi, Y., Ni, L., Fan, Y., Yunpeng, C., Zengqi, S.: Technical solutions of tsinghuaeolus for Robotic soccer. In: Polani, D., Browning, B., Bonarini, A., Yoshida, K. (eds.) RoboCup 2003. LNCS (LNAI), vol. 3020, pp. 205–213. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-25940-4_18
Chapter Google Scholar
Kok, J.R., Vlassis, N., Groen, F.: UvA Trilearn 2003 team description. In: Polani, D., Browning, B., Bonarini, A., Yoshida, K. (eds.) CD RoboCup 2003. Springer, Heidelberg (2003)
Google Scholar
Prokopenko, M., Kowalczyk, R., Lee, M., Wong, W.Y.: Designing and modelling situated agents systematically: Cyberoos98. In: Proceedings of the PRICAI-98 Workshop on RoboCup, pp. 75–89 (1998)
Google Scholar
Prokopenko, M., Butler, M., Howard, T.: On emergence of scalable tactical and strategic behaviour. In: Stone, P., Balch, T., Kraetzschmar, G. (eds.) RoboCup 2000. LNCS (LNAI), vol. 2019, pp. 357–366. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-45324-5_39
Chapter Google Scholar
Prokopenko, M., Wang, P., Howard, T.: Cyberoos 2001: Deep behaviour projection agent architecture. In: Birk, A., Coradeschi, S., Tadokoro, S. (eds.) RoboCup 2001. LNCS, vol. 2377, pp. 507–510. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-45603-1_70
Chapter Google Scholar
Prokopenko, M., Wang, P.: Relating the Entropy of joint beliefs to multi-agent coordination. In: Kaminka, G.A., Lima, P.U., Rojas, R. (eds.) RoboCup 2002. LNCS (LNAI), vol. 2752, pp. 367–374. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-45135-8_32
Chapter Google Scholar
Prokopenko, M., Wang, P.: Evaluating team performance at the edge of chaos. In: Polani, D., Browning, B., Bonarini, A., Yoshida, K. (eds.) RoboCup 2003. LNCS (LNAI), vol. 3020, pp. 89–101. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-25940-4_8
Chapter Google Scholar
Nehaniv, C., Polani, D., Olsson, L., Klyubin, A.: Evolutionary information-theoretic foundations of sensory ecology: channels of organism-specific meaningful information. In: da Fontoura Costa, L., Müller, G.B. (eds.) The 10th Altenberg Workshop in Theoretical Biology 2004 - Modeling Biology: Structures, Behavior, Evolution, Konrad Lorenz Institute for Evolution and Cognition Research, Altenberg, Austria, pp. 9–11 (2005)
Google Scholar
Prokopenko, M., Gerasimov, V., Tanev, I.: Measuring spatiotemporal coordination in a modular robotic system. In: Rocha, L., Yaeger, L., Bedau, M., Floreano, D., Goldstone, R., Vespignani, A., (eds.) Artificial Life X: Proceedings of The 10th International Conference on the Simulation and Synthesis of Living Systems, Bloomington IN, USA, pp. 185–191 (2006)
Google Scholar
Prokopenko, M., Gerasimov, V., Tanev, I.: Evolving Spatiotemporal coordination in a modular robotic system. In: Nolfi, S., Baldassarre, G., Calabretta, R., Hallam, J.C.T., Marocco, D., Meyer, J.-A., Miglino, O., Parisi, D. (eds.) SAB 2006. LNCS (LNAI), vol. 4095, pp. 558–569. Springer, Heidelberg (2006). https://doi.org/10.1007/11840541_46
Chapter Google Scholar
Riedmiller, M., Gabel, T., Trost, F., Schwegmann, T.: Brainstormers 2D - team description 2008. In: RoboCup 2008: Robot Soccer World Cup XII; CD (2008)
Google Scholar
Zhang, H., Chen, X.: The decision-making framework of WrightEagle, the RoboCup 2013 Soccer simulation 2D league champion team. In: Behnke, S., Veloso, M., Visser, A., Xiong, R. (eds.) RoboCup 2013. LNCS, vol. 8371, pp. 114–124. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-44468-9_11
Chapter Google Scholar
Akiyama, H., Noda, I.: Multi-agent positioning mechanism in the dynamic environment. In: Visser, U., Ribeiro, F., Ohashi, T., Dellaert, F. (eds.) RoboCup 2007. LNCS (LNAI), vol. 5001, pp. 377–384. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-68847-1_38
Chapter Google Scholar
Akiyama, H., Shimora, H.: Helios 2010 team description. In: RoboCup 2010: Robot Soccer World Cup XIV; CD (2010)
Google Scholar
Chew, L.P.: Constrained delaunay triangulations. Algorithmica 4(1–4), 97–108 (1989)
Article MathSciNet Google Scholar
Akiyama, H.: Agent2D Base Code (2010). http://www.rctools.sourceforge.jp
Prokopenko, M., Obst, O., Wang, P., Held, J.: Gliders 2012: tactics with action-dependent evaluation functions. In: RoboCup 2012 Symposium and Competitions: Team Description Papers, Mexico City, Mexico, June 2012 (2012)
Google Scholar
Prokopenko, M., Obst, O., Wang, P., Budden, D., Cliff, O.: Gliders 2013: tactical analysis with information dynamics. In: RoboCup 2013 Symposium and Competitions: Team Description Papers, Eindhoven, The Netherlands, June 2013 (2013)
Google Scholar
Prokopenko, M., Obst, O., Wang, P.: Gliders 2014: dynamic tactics with Voronoi diagrams. In: RoboCup 2014 Symposium and Competitions: Team Description Papers, Joao Pessoa, Brazil, July 2014 (2014)
Google Scholar
Prokopenko, M., Wang, P., Obst, O.: Gliders 2015: opponent avoidance with bio-inspired flocking behaviour. In: RoboCup 2015 Symposium and Competitions: Team Description Papers, Hefei, China, July 2015 (2015)
Google Scholar
Prokopenko, M., Wang, P., Obst, O., Jaurgeui, V.: Gliders 2016: integrating multi-agent approaches to tactical diversity. In: RoboCup 2016 Symposium and Competitions: Team Description Papers, Leipzig, Germany, July 2016 (2016)
Google Scholar
Tavafi, A., Nozari, N., Vatani, R., Yousefi, M.R., Rahmatinia, S., Pirdir, P.: MarliK 2012 soccer 2D simulation team description paper. In: RoboCup 2012 Symposium and Competitions: Team Description Papers, Mexico City, Mexico (2012)
Google Scholar
Kosorukoff, A.: Human based genetic algorithm. In: 2001 IEEE International Conference on Systems, Man, and Cybernetics, vol. 5, pp. 3464–3469. IEEE (2001)
Google Scholar
Cheng, C.D., Kosorukoff, A.: Interactive one-max problem allows to compare the performance of interactive and human-based genetic algorithms. In: Deb, K. (ed.) GECCO 2004. LNCS, vol. 3102, pp. 983–993. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-24854-5_98
Chapter Google Scholar
Tanev, I., Yuta, K.: Epigenetic programming: genetic programming incorporating epigenetic learning through modification of histones. Inf. Sci. 178(23), 4469–4481 (2008)
Article Google Scholar
Lizier, J.T., Prokopenko, M., Zomaya, A.Y.: Information modification and particle collisions in distributed computation. Chaos 20(3), 037109 (2010)
Article MathSciNet Google Scholar
Wang, X.R., Miller, J.M., Lizier, J.T., Prokopenko, M., Rossi, L.F.: Quantifying and tracing information cascades in swarms. PLoS One 7(7), e40084 (2012)
Article Google Scholar
Ay, N., Bernigau, H., Der, R., Prokopenko, M.: Information-driven self-organization: the dynamical system approach to autonomous robot behavior. Theor. Biosci. 131, 161–179 (2012)
Article Google Scholar
Lizier, J.T., Prokopenko, M., Zomaya, A.Y.: Coherent information structure in complex computation. Theor. Biosci. 131, 193–203 (2012)
Article Google Scholar
Cliff, O.M., Lizier, J.T., Wang, X.R., Wang, P., Obst, O., Prokopenko, M.: Towards quantifying interaction networks in a football match. In: Behnke, S., Veloso, M., Visser, A., Xiong, R. (eds.) RoboCup 2013. LNCS (LNAI), vol. 8371, pp. 1–12. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-44468-9_1
Chapter Google Scholar
Lizier, J.T., Prokopenko, M., Zomaya, A.Y.: A framework for the local information dynamics of distributed computation in complex systems. In: Prokopenko, M. (ed.) Guided Self-Organization: Inception. ECC, vol. 9, pp. 115–158. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-642-53734-9_5
Chapter Google Scholar
Luke, S.: Genetic programming produced competitive soccer softbot teams for RoboCup 97. In: Koza, J.R., Banzhaf, W., Chellapilla, K., Deb, K., Dorigo, M., Fogel, D.B., Garzon, M.H., Goldberg, D.E., Iba, H., Riolo, R.L., (eds.) Proceedings of the 3rd Annual Genetic Programming Conference, Morgan Kaufmann, pp. 214–222 (1998)
Google Scholar
Budden, D., Prokopenko, M.: Improved particle filtering for pseudo-uniform belief distributions in robot localisation. In: Behnke, S., Veloso, M., Visser, A., Xiong, R. (eds.) RoboCup 2013. LNCS (LNAI), vol. 8371, pp. 385–395. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-44468-9_34
Chapter Google Scholar

Download references

Acknowledgments

Several people contributed to Cyberoos and Gliders development over the years. Marc Butler, Thomas Howard and Ryszard Kowalczyk made exceptionally valuable contributions to Cyberoos’ effort during 1998–2002 [9, 19,20,21]. We are grateful to Gliders team members Oliver Obst, particularly for establishing the tournament infrastructure supporting the team’s performance evaluation on CSIRO Accelerator Cluster (Bragg), and Victor Jauregui, for several important insights on soccer tactics used in Gliders2016 [37]. We thank David Budden for developing a new self-localisation method introduced in Gliders2013 [34, 49] as well as contributing to the analysis of competition formats [4], and Oliver Cliff for developing a new communication scheme adopted by Gliders from 2014 [35]. The overall effort has also benefited from the study quantifying tactical interaction networks, carried out in collaboration with Cliff et al. [46]. We are thankful to Ivan Duong, Edward Moore and Jason Held for their contribution to Gliders2012 [33]. Gliders team logo was created by Matthew Chadwick.

Author information

Authors and Affiliations

Complex Systems Research Group, Faculty of Engineering and IT, The University of Sydney, Sydney, NSW, 2006, Australia
Mikhail Prokopenko
Data Mining, CSIRO Data61, PO Box 76, Epping, NSW, 1710, Australia
Peter Wang

Authors

Mikhail Prokopenko
View author publications
You can also search for this author in PubMed Google Scholar
Peter Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mikhail Prokopenko .

Editor information

Editors and Affiliations

University of Bonn, Bonn, Germany
Sven Behnke
Department of Computing, Curtin University, Perth, Western Autralia, Australia
Raymond Sheh
Computer Engineering Department, Istanbul Technical University, Istanbul, Turkey
Sanem Sarıel
School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania, USA
Daniel D. Lee

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Prokopenko, M., Wang, P. (2017). Disruptive Innovations in RoboCup 2D Soccer Simulation League: From Cyberoos’98 to Gliders2016. In: Behnke, S., Sheh, R., Sarıel, S., Lee, D. (eds) RoboCup 2016: Robot World Cup XX. RoboCup 2016. Lecture Notes in Computer Science(), vol 9776. Springer, Cham. https://doi.org/10.1007/978-3-319-68792-6_44

Download citation

DOI: https://doi.org/10.1007/978-3-319-68792-6_44
Published: 01 November 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-68791-9
Online ISBN: 978-3-319-68792-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics