Skip to main content
Log in

Not by metadata alone: the use of diverse forms of knowledge to locate data for reuse

  • REGULAR PAPER
  • Published:
International Journal on Digital Libraries Aims and scope Submit manuscript

Abstract

An important set of challenges for eScience initiatives and digital libraries concern the need to provide scientists with the ability to access data from multiple sources. This paper argues that an analysis of scientists‘ reuse of data prior to the advent of eScience can illuminate the requirements and design of digital libraries and cyberinfrastructure. As part of a larger study on data sharing and reuse, I investigated the processes by which ecologists locate data that were initially collected by others. Ecological data are unusually complex and present daunting problems of interpretation and analysis that must be considered in the design of cyberinfrastructure. The ecologists that I interviewed found ways to overcome many of these difficulties. One part of my results shows that ecologists use formal and informal knowledge that they have gained through disciplinary training and through their own data-gathering experiences to help them overcome hurdles related to finding, acquiring, and validating data collected by others. A second part of my findings reveals that ecologists rely on formal notions of scientific practice that emphasize objectivity to justify the methods they use to collect data for reuse. I discuss the implications of these findings for digital libraries and eScience initiatives.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Andelman S., Bowles C., Willig M. and Waide R. (2004). Understanding environmental complexity through a distributed knowledge network. BioScience 54: 240–246

    Article  Google Scholar 

  2. Atkins, D., Droegemeier, K., Feldman, S., Garcia-Molina, H., Messerschmitt, D., Messina, P., Ostriker, J., Wright, H.: Revolutionizing Science and Engineering through Cyberinfrastructure: Report of the National Science Foundation Blue-Ribbon Panel on Cyberinfrastructure. National Science Foundation, Washington, DC (2003)

  3. Baskin Y. (1997). Center seeks synthesis to make ecology more useful. Science 275(5298): 310–311

    Article  Google Scholar 

  4. Berman, F., Brady, H.: Final Report: NSF SBE-CISE Workshop on Cyberinfrastructure and the Social Sciences (2005) http://vis.sdsc.edu/sbe/reports/SBE-CISE-FINAL.pdf [last visited November 2006]

  5. Birnholtz, J.P., Bietz, M.J.: Data at work: supporting sharing in science and engineering. In: Proceedings of the 2003 International ACM SIGGROUP Conference on Supporting Group Work. pp. 339–348 ACM Press, New York (2003)

  6. Bowker G.C. (2000). Biodiversity datadiversity. Soc. Stud. Sci. 30(5): 643–683

    Article  Google Scholar 

  7. Bowser, C.J.: Historic data sets: lessons from the past, lessons for the future. In: Michener, W.K. (ed.) Research Data Management in the Ecological Sciences. pp. 155–179. University of South Carolina Press, Columbia (1986)

  8. Brown C. (2003). The changing face of scientific discourse: analysis of genomic and proteomic database usage and acceptance. J. Am. Soc. Inform. Sci. Technol. 54(10): 926–938

    Article  Google Scholar 

  9. Buetow K.H. (2005). Cyberinfrastructure: empowering a “Third Way” in biomedical research. Science 308(5723): 821–824

    Article  Google Scholar 

  10. Chen, C., Hernon, P. (eds.): Numeric databases. Ablex, Norwood, NJ (1984)

  11. Clarke, A.E., Fujimura, J.H.: What tools? Which jobs? Why right? In: Clarke, A.E., Fujimura, J.H. (eds) The Right Tools for the Job: At Work in Twentieth-Century Life Sciences. pp. 3–44. Princeton University Press, Princeton (1992)

  12. Collins, H.M.: [1985] Changing Order: Replication and Induction in Scientific Practice. University of Chicago Press, Chicago (1992)

  13. Committee on the Future of Long-Term Ecological Data: Final Report of the Ecological Society of America Committee on the Future of Long-Term Ecological Data (FLED) (Vols. 1–2). Ecological Society of America, Washington, DC (1995)

  14. Cooper, H., Hedges, L.V.: The Handbook of Research Synthesis. Russell Sage Foundation, New York (1994)

  15. Crow, R.: The Case for Institutional Repositories: A SPARC Position Paper. The Scholarly Publishing & Academic Resources Coalition, Washington, DC (2002)

  16. Ecological Visions Committee: Ecological Science and Sustainability for a Crowded Planet: 21st Century Vision and Action Plan for the Ecological Society of America. Ecological Society of America, Washington, DC (2004)

  17. Ellisman M.H. (2005). Cyberinfrastructure and the future of collaborative work. Issues Sci. Technol. 22(1): 43–50

    Google Scholar 

  18. Emmott, S.: Towards 2020 Science. Microsoft Research, Redmond, WA (2006)

  19. Glasner P. (2002). Beyond the genome: reconstituting the new genetics. New Genet. Soc. 21(3): 267–277

    Article  Google Scholar 

  20. Gray, A.S., Dodd, S.A.: The roles of libraries and information centers in providing access to numeric databases. In: Chen, C., Hernon, P. (eds.) Numeric Databases. pp. 247–262. Ablex, Norwood (1984)

  21. Hey T. and Trefethen A.E. (2005). Cyberinfrastructure for e-Science. Science 308(5723): 817–821

    Article  Google Scholar 

  22. Hilgartner S. (1995). Biomolecular databases: new communication regimes for biology?. Sci. Commun. 17(2): 240–263

    Article  Google Scholar 

  23. Hine C. (2006). Databases as scientific instruments and their role in the ordering of scientific work. Soc. Stud. Sci. 36(2): 269–298

    Article  Google Scholar 

  24. Kuklick H. and Kohler R.E. (1996). Science in the field: introduction. OSIRIS, Second series 11: 1–16

    Google Scholar 

  25. Lynch, C.: Research libraries engage the digital world: A US-UK comparative examination of recent history and future prospects. Ariadne 46 http://www.ariadne.ac.uk/issue46/lynch/ [last visited November 2006] (2006)

  26. Michener, W.K., Brunt, J.W. (eds.): Ecological Data: Design, Management, and Processing. Blackwell Science, Oxford, UK (2000)

  27. Michener W.K., Brunt J.W., Helly J.J., Kirchner T.B. and Stafford S.G. (1997). Nongeospatial metadata for the ecological sciences. Ecol. Appl. 7(1): 330–342

    Article  Google Scholar 

  28. National Research Council: Finding the Forest in the Trees: The Challenge of Combining Diverse Environmental Data: Selected Case Studies. National Academy Press, Washington, DC (1995)

  29. National Research Council: NEON: Addressing the Nation’s Environmental Challenges. National Academy Press, Washington, DC (2003)

  30. National Science Board: Long-Lived Data Collections: Enabling Research and Education in the 21st Century. National Science Foundation, Arlington, VA (2005)

  31. Pouchard L., Woolf A. and Bernholdt D. (2005). Data grid discovery and semantic web technologies for the earth sciences. Int. J. Dig. Lib. 5: 72–83

    Article  Google Scholar 

  32. Roth W.-M. and Bowen G.M. (1999). Digitizing lizards: the topology of ‘vision’ in ecological fieldwork. Soc. Stud. Sci. 29(5): 719–764

    Article  Google Scholar 

  33. Roth W.-M. and Bowen G.M. (2001). ‘Creative solutions’ and ‘fibbing results’: enculturation in field ecology. Soc. Stud. Sci. 31(4): 533–556

    Article  Google Scholar 

  34. Roth W.-M. and Bowen G.M. (2001). Of disciplined minds and disciplined bodies: on becoming an ecologist. Qual. Sociol. 24(4): 459–481

    Article  Google Scholar 

  35. Schiff, L.R., Van House, N.A., Butler, M.H.: Understanding complex information environments: a social analysis of watershed planning. In: Allen, R.B., Rasmussen, E.M. (eds.) Proceedings of the Second ACM International Conference on Digital Libraries. pp. 161-168. ACM Press, New York (1997)

  36. Smith H.J. and Keil M. (2003). The reluctance to report bad news on troubled software projects: a theoretical model. Inform. Syst. J. 13: 69–95

    Article  Google Scholar 

  37. Smith J.T. (1996). Meta-analysis: the librarian as a member of an interdisciplinary research team. Library Trends 45(2): 265–279

    Google Scholar 

  38. Star S.L. and Ruhleder K. (1996). The ecology of infrastructure: problems in the implementation of large-scale information systems. Inform. Syst. Res. 7: 111–134

    Article  Google Scholar 

  39. Weeber M., Kors J.A. and Mons B. (2005). Online tools to support literature-based discovery in the life sciences. Briefings Biomed. Inform. 6(3): 277–286

    Google Scholar 

  40. Wouters, P., Reddy, C.: Big science data policies. In: Wouters, P., Schröder, P. (eds.) Promise and Practice in Data Sharing, NIWI-KNAW, Amsterdam, pp. 13–40 (2003)

  41. Zimmerman, A.: Data Sharing and Secondary Use of Scientific Data: Experiences of Ecologists. Unpublished dissertation, University of Michigan, Ann Arbor (2003)

  42. Zimmerman, A.: New knowledge from old data: the role of standards in the sharing and reuse of ecological data. Sci Technol Human Values (2008, in press)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ann Zimmerman.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zimmerman, A. Not by metadata alone: the use of diverse forms of knowledge to locate data for reuse. Int J Digit Libr 7, 5–16 (2007). https://doi.org/10.1007/s00799-007-0015-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00799-007-0015-8

Keywords

Navigation