Toward a Methodology for Agent-Based Data Mining and Visualization

  • Elizabeth Sklar
  • Chipp Jansen
  • Jonathan Chan
  • Michael Byrd
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7103)


We explore the notion of agent-based data mining and visualization as a means for exploring large, multi-dimensional data sets. In Reynolds’ classic flocking algorithm (1987), individuals move in a 2-dimensional space and emulate the behavior of a flock of birds (or “boids”, as Reynolds refers to them). Each individual in the simulated flock exhibits specific behaviors that dictate how it moves and how it interacts with other boids in its “neighborhood”. We are interested in using this approach as a way of visualizing large multi-dimensional data sets. In particular, we are focused on data sets in which records contain time-tagged information about people (e.g., a student in an educational data set or a patient in a medical records data set). We present a system in which individuals in the data set are represented as agents, or “data boids”. The flocking exhibited by our boids is driven not by observation and emulation of creatures in nature, but rather by features inherent in the data set. The visualization quickly shows separation of data boids into clusters, where members are attracted to each other by common feature values.


Geographic Information System Cellular Automaton Categorical Feature Information Visualization Steering Vector 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Aupetit, S., Monmarché, N., Slimane, M., Guinot, C., Venturini, G.: Clustering and Dynamic Data Visualization with Artificial Flying Insect. In: Cantú-Paz, E., Foster, J.A., Deb, K., Davis, L., Roy, R., O’Reilly, U.-M., Beyer, H.-G., Kendall, G., Wilson, S.W., Harman, M., Wegener, J., Dasgupta, D., Potter, M.A., Schultz, A., Dowsland, K.A., Jonoska, N., Miller, J., Standish, R.K. (eds.) GECCO 2003, Part I. LNCS, vol. 2723, pp. 140–141. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  2. 2.
    Butler, D.: Virtual globes: The web-wide world. Nature 439, 776–778 (2006)CrossRefGoogle Scholar
  3. 3.
    Cao, L., Gorodetsky, V., Mitkas, P.A.: Agent mining: The synergy of agents and data mining. IEEE Intelligent Systems 24(3), 64–72 (2009)CrossRefGoogle Scholar
  4. 4.
    Deneubourg, J.L., Goss, S., Franks, N., Sendova-Franks, A., Detrain, C., Chrétian, L.: The dynamics of collective sorting: Robot-like ants and ant-like robots. In: From Animals to Animats: 1st International Conference on Simulation of Adaptative Behaviour, pp. 356–363 (1990)Google Scholar
  5. 5.
    Dorigo, M., Maniezzo, V., Colorni, A.: The Ant System: Optimization by a colony of cooperating agents. IEEE Transactions on Systems, Man and Cybernetics-Part B 26(1), 1–13 (1996)CrossRefGoogle Scholar
  6. 6.
    Fisher, D.H.: Knowledge Acquisition Via Incremental Conceptual Clustering. Machine Learning 2, 139–172 (1987)Google Scholar
  7. 7.
    Google: Google earth (2005),
  8. 8.
    Handl, J., Meyer, B.: Ant-based and swarm-based clustering. Swarm Intelligence 1(2), 95–113 (2007)CrossRefGoogle Scholar
  9. 9.
    Lisle, R.J.: Google earth: a new geological resource. Geology Today (2006)Google Scholar
  10. 10.
    MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297 (1967)Google Scholar
  11. 11.
    Moere, A.V.: Time-varying data visualization using information flocking boids. In: Proceedings of IEEE Symposium on Information Visualization, pp. 10–12 (2004)Google Scholar
  12. 12.
    Moere, A.V.: A model for self-organizing data visualization using decentralized multiagent systems. In: Prokopenko, M. (ed.) Advances in Applied Self-organizing Systems, Advanced Information and Knowledge Processing, Part III, pp. 291–324. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  13. 13.
    Picarougne, F., Azzag, H., Venturini, G., Guinot, C.: On data clustering with a flock of artificial agents. In: Proceedings of the 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI), pp. 777–778 (2004)Google Scholar
  14. 14.
    Picarougne, F., Azzag, H., Venturini, G., Guinot, C.: A new approach of data clustering using a flock of agents. Evolutionary Computation 15(3), 345–367 (2007)CrossRefGoogle Scholar
  15. 15.
    Processing (2010),
  16. 16.
    Proctor, G., Winter, C.: Information Flocking: Data Visualisation in Virtual Worlds Using Emergent Behaviours. In: Heudin, J.-C. (ed.) VW 1998. LNCS (LNAI), vol. 1434, pp. 168–176. Springer, Heidelberg (1998)CrossRefGoogle Scholar
  17. 17.
    Reynolds, C.W.: Flocks, Herds and Schools: A Distributed Behavioral Model. In: International Conference on Computer Graphics and Interactive Systems, pp. 25–34 (1987)Google Scholar
  18. 18.
    Shannon, C.E.: A mathematical theory of communication. The Bell System Technical Journal 27, 379–423 (1948)MathSciNetCrossRefzbMATHGoogle Scholar
  19. 19.
  20. 20.
    Wolfram, S.: Cellular automata as models of complexity. Nature 311, 419–424 (1984)CrossRefGoogle Scholar
  21. 21.
    Xiaohui Cui, J.G., Potok, T.E.: A flocking based algorithm for document clustering analysis. Journal of Systems Architecture 52(8-9), 505–515 (2006)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Elizabeth Sklar
    • 1
    • 2
  • Chipp Jansen
    • 1
    • 3
  • Jonathan Chan
    • 1
  • Michael Byrd
    • 2
  1. 1.Brooklyn CollegeThe City University of New YorkUSA
  2. 2.The Graduate CenterThe City University of New YorkUSA
  3. 3.Hunter CollegeThe City University of New YorkUSA

Personalised recommendations