Data Explosion, Data Nature and Dataology

  • Yangyong Zhu
  • Ning Zhong
  • Yun Xiong
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5819)


The essence of computer applications is to store things in the real world into computer systems in the form of data, i.e., it is a process of producing data. Some data are the records related to culture and society, and others are the descriptions of phenomena of universe and life. The large scale of data is rapidly generated and stored in computer systems, which is called data explosion. Data explosion forms data nature in computer systems. To explore data nature, new theories and methods are required. In this paper, we present the concept of data nature and introduce the problems arising from data nature, and then we define a new discipline named dataology (also called data science or science of data), which is an umbrella of theories, methods and technologies for studying data nature. The research issues and framework of dataology are proposed.


Cloud Computing Real Nature Data Nature Brain Data Internet Data 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
  2. 2.
    Krawetz, S., Misener, S. (eds.): Bioinformatics Methods and Protocols. Humana Press (2000)Google Scholar
  3. 3.
    Zhong, N., Liu, J., Yao, Y.Y., Wu, J., Lu, S., Li, K. (eds.): Web Intelligence Meets Brain Informatics. LNCS (LNAI), vol. 4845, pp. 1–31. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  4. 4.
    Cao, L.B.: Behavior Informatics and Analytics: Let Behavior Talk. In: Proceedings of the 2008 IEEE International Conference on Data Mining Workshops (2008)Google Scholar
  5. 5.
    Berman, F., Fox, G., Hey, A. (eds.): Grid Computing: Making the Global Infrastructure a Reality. John Wiley & Sons, Chichester (2003)Google Scholar
  6. 6.
    Hayes, B.: Cloud Computing. Communications of the ACM 51(7), 9–11 (2008)CrossRefGoogle Scholar
  7. 7.
    Zhong, N., Liu, J., Yao, Y.Y.: Envisioning Intelligent Information Technologies through the Prism of Web Intelligence. Communications of the ACM 50(3), 89–94 (2007)CrossRefGoogle Scholar
  8. 8.
    Chapman, G., Marc, R.: The National Information Infrastructure: A Public Interest Opportunity. Computer Professionals for Social Responsibility 11(2), 13–15 (1993)Google Scholar
  9. 9.
    Collins, F.S., Patrinos, A., Jordan, E., et al.: New Goals for the U.S. Human Genome Project: 1998-2003. Science 282(5389), 682–689 (1998)CrossRefGoogle Scholar
  10. 10.
    Freeston, M.: The Alexandria Digital Library and the Alexandria Digital Earth Prototype. In: Proceedings of the 4th ACM/IEEE-CS joint Conference on Digital Libraries (2004)Google Scholar
  11. 11.
    Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P.: From Data Mining to Knowledge Discovery: an overview. In: Advances in Knowledge Discovery and Data Mining (1996)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Yangyong Zhu
    • 1
  • Ning Zhong
    • 2
  • Yun Xiong
    • 1
  1. 1.School of Computer ScienceFudan UniversityShanghaiP.R. China
  2. 2.Dept. of Life Science and InformaticsMaebashi Institute of TechnologyMaebashi-CityJapan

Personalised recommendations