Advertisement

Springer Nature is making Coronavirus research free. View research | View latest news | Sign up for updates

Enriching the Notion of Data Curation in E-Science: Data Managing and Information Infrastructuring in the Long Term Ecological Research (LTER) Network

Abstract

This paper aims to enrich the current understanding of data curation prevalent in e-Science by drawing on an ethnographic study of one of the longest-running efforts at long-term consistent data collection with open data sharing in an environment of interdisciplinary collaboration. In such a context we identify a set of salient characteristics of ecological research and data that shape the data stewardship approach of the Long Term Ecological Research (LTER) network. We describe the actual practices through which LTER information managers attend to the extended temporal scale of long-term research and data sets both through data care work and information infrastructure development. We discuss the issues of long-term and continuity that represent central challenges for data curation and stewardship. We argue for more efforts to be directed to understanding what is at stake with a long-term perspective and differing temporal scales as well as to studying actual practices of data curation and stewardship in order to provide more coherent understandings of e-Science solutions and technologies.

This is a preview of subscription content, log in to check access.

References

  1. Ackerman M.S., Halverson C. (2004). Organizational Memory as Objects, Processes, and Trajectories: An Examination of Organizational Memory in Use. Computer Supported Cooperative Work (CSCW), The Journal of Collaborative Computing 13(2):155–189

  2. Arzberger, P., P. Scroeder, A. Beaulieu, G. Bowker, K. Casey, L. Laaksonen, D. Moorman, P. Uhlir and P. Wouters (2003): Promoting Access to Public Research Data for Scientific, Economic and Social Development, OECD Follow Up Group on Issues of Access to Publicly Funded Research Data, Final Report. Available: http://www.dataaccess.ucsd.edu/ Final_Report_2003.pdf [Last referenced: 23.05.2006]

  3. Atkins, D.E., K.K. Droegemeier, S.I. Feldman, H. Garcia-Molina, M.L. Klein, D.G. Messerschmitt, P. Messina, J.P. Ostriker and M.H. Wright (2003): Revolutionizing Science and Engineering Through Cyberinfrastructure, Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure [Web-document]. Available: http://www.communitytechnology.org/nsf_ci_report/ [Last referenced: 23.05.2006]

  4. Baker K.S., Benson B.J., Henshaw D.L., Blodgett D., Porter J.H., Stafford S.G. (2000). Evolution of a Multisite Network Information System: The LTER Information Management Paradigm. BioScience 50(11):963–978

  5. Baker, K.S., D. Ribes, F. Millerand and G.C. Bowker (2005): Interoperability Strategies for Scientific Cyberinfrastruture: Research and Practice. American Society for Information Systems and Technology. In 05ASIST. American Society of Information Science and Technology, Proceedings Bringing Research and Practice Together, Charlotte, North Carolina, October 28 to November 02, 2005

  6. Bertelsen O.W., Bødker S. (2001). Cooperation in Massively Distributed Information Spaces. In: Prinz W., Jarke M., Rogers Y., Schmidt K., Wulf V. (eds), ECSCW. Seventh European Conference on Computer-Supported Cooperative Work, September 16 to 20, 2001. Bonn, Germany, Dordrecht, Kluwer Academic Publishers, pp. 1–17

  7. Birnholtz, J.P. and M.J. Bietz (2003): Data at Work: Supporting Sharing in Science and Engineering. In M. Tremaine (ed.): GROUP’03. Proceedings of the 2003 International ACM SIGGROUP Conference on Supporting Group Work, 2003 November 9 to 12, 2003. ACM Press, pp. 339–348

  8. Bowker G.C. (2000). Biodiversity Datadiversity. Social Studies of Science 30(5):643–683

  9. Brand S. (1994). How Buildings Learn. What Happens After They’re Built. New York, Viking, pp. 243

  10. Buneman, P., L. Lyon and C. Rusbridge (2005): Comments from the Digital Curation Centre on Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century, a draft report of the National Science Board. Available: http://www.dcc.ac.uk/docs/nsbreport.pdf [Last referenced: 23.05.2006]

  11. Callahan J.T. (1984). Long-Term Ecological Research. BioScience 34(6):363–367

  12. CCSDS 650.0-B-1 (2002): Reference Model for an Open Archival Information System (OAIS). Washington, DC, USA: National Aeronautics and Space Administration

  13. Chervenak A, Foster I., Kesselman C., Salisbury C., Tuecke S. (1999). The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets. Journal of Network and Computer Applications 23(3):187–200

  14. Chin, G. Jr. and C.S. Lansing (2004): Capturing and Supporting Contexts for Scientific Data Sharing via the Biological Sciences Collaboratory. In CSCW’04. ACM Conference on Computer Supported Cooperative Work, November 6 to 10, 2004. Chicago, Illinois, USA, pp. 409–418

  15. Christiansen E. (1997). Gardening: A Metaphor for Sustainability in Information Technology-Technical Support. In: Berleur J., Whitehouse D. (eds), An Ethical Global Information Society: Culture and Democracy Revisited. London, Chapman & Hall

  16. Dittrich, Y., S. Eriksén and C. Hansson (2002): PD in the Wild: Evolving Practices of Design in Use. In T. Binder, J. Gregory and I. Wagner (eds.): PDC’02. Proceedings of the Participatory Design Conference, Malmö, Sweden, June 23 to 25, 2002. Palo Alto, CA: CPSR, pp. 124–134

  17. Driscoll C.T., Lawrence G.B., Bulger A.J., Butler T.J., Cronan C.S., Eagar C., Lambert K.F., Likens G.E., Stoddard J.L., Weathers K.C. (2001). Acidic Deposition in The Northeastern United States: Sources and Inputs, Ecosystem Effects, and Management Strategies. BioScience 51:180–198

  18. Finholt T.A., Olson G.M. (1997). From Laboratories to Collaboratories: A New Organizational Form for Scientific Collaboration. Psychological Science 8(1):28–36

  19. Fisher G., Ostwald J. (2002). Seeding, Evolutionary Growth, and Reseeding: Enriching Participatory Design with Informed Participation. In: Binder T., Gregory J., Wagner I. (eds), PDC’02. Participatory Design Conference, Malmö, Sweden, June 23–25, 2002. Palo Alto, CA, CPSR, pp. 135–143

  20. Fischer G., Giaccardi E., Ye Y., Sutcliffe A.G., Mehandjiev N. (2004). Meta-design: A manifesto for End-User Development. Communications of the ACM 47(9):33–37

  21. Franklin M., Halevy A., Maier D. (2005). From Databases to Dataspaces: A New Abstraction for Information Management. SIGMOD Record 34(4):27–33

  22. Gray J., Liu D.T., Nieto-Santisteban M., Szalay A., DeWitt D.J., Heber G. (2005). Scientific Data Management in the Coming Decade. SIGMOD Record 34(4):34–41

  23. Greenbaum J.M., Kyng M. (1991). Design at Work: Cooperative Design of Computer Systems. Hillsdale, New Jersey, Lawrence Erlbaum Associates

  24. Greif I., Sarin S. (1987). Data Sharing in Group Work. ACM Transactions on Office Information Systems 5(2):187–211

  25. Grimm N.B., Redman C.L. (2004). Approaches to The Study of Urban Ecosystems: The Case of Central Arizona-Phoenix. Urban Ecosystems 7:199–213

  26. Gross, K.L., C.E. Pake, E. Allen, C. Bledsoe, R. Colwell, P. Dayton, M. Dethier, J. Helly, R. Holt, N. Morin, W. Michener, S.T. A. Pickett and S. Stafford (1995): Final Report of the Ecological Society of America Committee on the Future of Long-term Ecological Data (FLED), Volume I: Text of the Report. Washington, DC: The Ecological Society of America

  27. Hanseth O., Monteiro E., Hatling M. (1996). Developing Information Infrastructure: The Tension between Standardisation and Flexibility. Science, Technology, & Human Values 21(4):407–426

  28. Harmon M.E., Nadelhoffer K.J., Blair J.M. (1999). Measuring Decomposition, Nutrient Turnover, and Stores in Plant Litter. In: Robertson G.P., Bledsoe C.S., Coleman D.C., Sollins P. (eds), Standard Soil Methods for Long Term Ecological Research. New York, Oxford University Press, pp. 202–240

  29. Hayden B.P. (2000). Climate Change and Exratropical Storminess in the United States: An Assessment. Journal of American Water Resources Association 35(6):1387–1397

  30. Hedstrom, M. (2003): It’s About Time: Research Challenges in Digital Archiving and Long-term Preservation, Final Report. Workshop on Research Challenges in Digital Archiving and Long-term Preservation, April 12 to 13, 2002. Sponsored by the National Science Foundation and The Library of Congress

  31. Helly J.J., Todd Elvins T., Sutton D., Martinez D., Miller S.E., Pickett S., Ellison A.M. (2002). Controlled Publication of Digital Scientific Data. Communications of the ACM 45(5):97–101

  32. Henderson A., Kyng M. (1991). There’s No Place like Home: Continuing Design in Use. In: Greenbaum J., Kyng M. (eds), Design at Work. London, New Jersey, Lawrence Erlbaum

  33. Hey, T. and A.E. Trefethen (2003): The Data Deluge: An e-Science Perspective. In F. Berman, G. Fox and T. Hey (eds.): Wiley Grid Computing: Making the Global Infrastructure a reality. John Wiley & Sons Ltd, pp. 809–824

  34. Hilgartner S. (1995). Biomolecular Databases: New Communication Regimes for Biology? Science Communication 17(2):240–263

  35. Hobbie J.E., Carpenter S.R., Grimm N.B., Gosz J.R., Seastedt T.R. (2003). The US Long Term Ecological Research Program. BioScience 53(1):21–32

  36. Hodge, G. and E. Frangakis (2004): Digital Preservation and Permanent Access to Scientific Information: The State of the Practice (CENDI/04–3), The International Council for Scientific and Technical Information (ICSTI) and CENDI (U.S. Federal Information Managers Group). February 2004, Revised April 2004. Available: http://www.icsti.org/icsti/icsti_reports.html [Last referenced: 23.05.2006]

  37. Jirotka M., Procter R., Hartswood M., Slack R., Simpson A., Coopmans C., Hinds C., Voss A. (2005). Collaboration and Trust in Healthcare Innovation: The eDiaMoND Case Study. Computer Supported Cooperative Work (CSCW), The Journal of Collaborative Computing 14(4):369–398

  38. Johansen R. (1988). Groupware. Computer Support for Business Teams. New York, The Free Press

  39. Kanstrup, A.M. (2005): Local Design: An Inquiry into Work Practices of Local IT Supporters. PhD theses. Department of Communication, Aalborg University, Denmark

  40. Kaplan, S. and L. Seeback (2001): Harnessing Complexity. In ECSCW. Proceedings of the Seventh European Conference on Computer Supported Cooperative Work, September 16 to 20, 2001, Bonn, Germany. Netherlands: Kluwer Academic Publishers, pp. 359–397

  41. Karasti, H. and K.S. Baker (2004): Infrastructuring for the Long-Term: Ecological Information Management. In HICSS’3. Proceedings of the Hawaii International Conference on System Sciences 2004, Hawaii, USA, January 5 to 8, 2004

  42. Karasti, H. and A.-L. Syrjänen (2004): Artful Infrastructuring in Two Cases of Community PD. In PDC 04. Proceedings of the Eighth Conference on Participatory Design: Artful integration: interweaving Media, Materials and Practices, Volume 1, Toronto, Ontario, Canada, July 27 to 31, 2004. New York: ACM Press, pp. 20–30

  43. Lamb R., Davidson E. (2005). Information and Communication Technology Challenges to Scientific Professional Identity. The Information Society 21(1):1–24

  44. Lave J., Wenger E. (1991). Situated Learning: Legitimate Peripheral Participation. Cambridge, Cambridge University Press

  45. Likens G.E. (1989). Long-Term Studies in Ecology: Approaches and Alternatives. New York, Springer-Verlag

  46. Lord, P. and A. Macdonald (2003): e-Science Curation Report-Data Curation for e-Science in the UK: An Audit to Establish Requirements for Future Curation and Provision. Prepared for the JISC Committee for the Support of Research (JCSR). Twickenham, UK, The Digital Archiving Consultancy Limited. Available: http://www.jisc.ac.uk/uploaded_documents/ e-ScienceReportFinal.pdf [Last referenced: 23.05.2006]

  47. Lord, P., A. Macdonald, L. Lyon and D. Giarretta (2004): From Data Deluge to Data Curation. In Proceedings of the UK e-science All Hands meeting 2004, pp. 371–375

  48. Magnuson J.J. (1990). Long-Term Ecological Research and the Invisible Present. BioScience 40(7):495–501

  49. Magnuson J.J., Rogbertson D.M., Benson B.J., Wynne R.H., Livingsone D.M., Arai T., Assel R.A., Barry R.G., Card V., Kuusisto E., Granin N.G., Prowse T.D., Stewart K.M., Vuglinski V.S. (2000). Historical Trends in Lake and River Ice Cover in The Northern Hemisphere. Science 289:1743–1746

  50. Markus L.M. (2001). Toward a Theory of Knowledge Reuse: Types of Knowledge Reuse Situations and Factors in Reuse Success. Journal of Management Information Systems 18(1):57–93

  51. Michener W.K. (2006). Meta-Information Concepts for Ecological Data Management. Ecological Informatics 1:3–7

  52. Michener W.K., Brunt J.W., Helly J.J., Kirchner T.B., Stafford S.G. (1997). Nongeospatial Metadata for the Ecological Sciences. Ecological Applications 7(1):330–342

  53. National Science Board (2005): Long Lived Digital Data Collections: Enabling Research and Education in the 21st Century, National Science Board (NSB-05-40, Revised May 23, 2005). Available: http://www.nsf.gov/pubs/2005/nsb0540/ [Last referenced: 23.05.2006]

  54. Newman H.B., Ellisman M.H., Orcutt J.A. (2003). Data-Intensive e-Science Frontier Research. Communications of the ACM 46(11):68–77

  55. OECD Global Science Forum (2005): Organisation for Economic Co-operation and Development Global Science Forum Report on Grids and Basic Research Programmes. Final consensus report from the OECD Global Science Forum Workshop, Sydney, Australia, September 25–27, 2005

  56. O’Day, V.L., A. Adler, A. Kuchinsky and A. Bouch (2001): When Worlds Collide: Molecular Biology as Interdisciplinary Collaboration. In W. Prinz, M. Jarke, Y. Rogers, K. Schmidt and V. Wulf (eds.): ECSCW. Seventh European Conference on Computer-Supported Cooperative Work, September 16 to 20, 2001, Bonn, Germany. Netherlands: Kluwer Academic Publishers, pp. 399–418

  57. Pickett S.T.A., Burch W.R., Grove J.M. (1999). Interdisciplinary Research: Maintaining the Constructive Impulse in a Culture of Criticism. Ecosystems 2:302–307

  58. Pipek, V. (2005): From Tailoring to Appropriation Support: Negotiating Groupware Usage. Doctoral thesis. Acta Universitatis Ouluensis, Series A, Scientiae rerum naturalium nro 430. Oulu 2005

  59. Rolland K.H., Monteiro E. (2002). Balancing the Local and the Global in Infrastructural Information Systems. The Information Society 18:87–100

  60. Rolland, K.H., V. Hepsø and E. Monteiro (2006): (Re)Conceptualizing Common Information Spaces across Heterogeneous Contexts: Im/Mutable Mobiles and Imperfection. Accepted for CSCW’06. ACM Conference on Computer Supported Cooperative Work

  61. Sandusky R.J. (2003). Infrastructure Management as Cooperative Work: Implications for Systems Design. Computer Supported Cooperative Work (CSCW), The Journal of Collaborative Computing 12:97–122

  62. Schmidt K., Bannon L. (1992). Taking CSCW Seriously: Supporting Articulation Work. Computer Supported Cooperative Work (CSCW), The Journal of Collaborative Computing 1(1–2):7–40

  63. Schuler D., Namioka A. (eds) (1993). Participatory Design: Principles and Practices. Hillsdale, NJ, Lawrence Erlbaum Associates

  64. Sonnenwald, D.H. (2003): Expectations for a Scientific Collaboratory: A Case Study. In GROUP ‘03. Proceedings of the International ACM SIGGROUP Conference on Supporting Group Work 2003. November 9 to 12, 2003. Sanibel Island, Florida, USA, pp.68–74

  65. Star S.L., Bowker G.C. (2002). How to Infrastructure. In Lievrouw L.A., Livingstone S. (eds), Handbook of New Media: Social Shaping and Consequences of ICTs. London, SAGE Publications, pp. 151–162

  66. Star S.L., Griesemer J.R. (1989). Institutional Ecology, ‘Translations’ and Boundary Objects: Amateurs and Professionals in Berkeley’s Museum of Vertebrate Zoology, 1907–39. Social Studies of Science 19:387–420

  67. Star S.L., Ruhleder K. (1996). Steps Toward an Ecology of Infrastructure: Design and Access for Large Information Spaces. Information Systems Research 7:111–133

  68. Star S.L., Strauss A. (1999). Layers of Silence, Arenas of Voice: The Ecology of Visible and Invisible Work. Computer Supported Cooperative Work (CSCW), The Journal of Collaborative Computing 8(1–2):9–30

  69. Sterling, T.D. and J.J. Weinkam (1990): Sharing Scientific Data. Communications of the ACM, ACM Press, vol. 33, no. 8, pp. 112–119

  70. Strauss A.L. (1975). Chronic Illness and the Quality of Life. Saint Louis, The C. V. Mosby Company

  71. Strauss A.L., Fagerhaugh S., Suczek B., Wiener C. (1985). Social Organization of Medical Work. Chicago, University of Chicago Press

  72. Suchman L. (1995): Special Issue: Representations of Work. Communications of the ACM 38(9):33–68

  73. Suchman, L. (2000): Located Accountabilities in Technology Production. Work-in-progress, revision of (Suchman, 1994), presented at the Sawyer Seminar on Heterarchies, Santa Fe Institute, October 2000

  74. Suchman L., Blomberg J.e, Orr J.E., Trigg R. (1999). Reconstructing Technologies as Social Practice. American Behavioral Scientist 43(3):392–408

  75. UK Research Council e-Science definition (2001): Available: http://www.rcuk.ac.uk/escience/. [Last referenced: 23.05.2006]

  76. Van House, N.A., M.H. Butler and L.R. Schiff (1998): Cooperative Knowledge Work and Practices of Trust: Sharing Environmental Planning Data Sets. In CSCW ‘98. Proceedings of the ACM Conference On Computer Supported Cooperative Work, November 14 to 18, 1998. Seattle, WA: ACM, pp. 335–343

  77. Waide R.B., Willig M.R., Steiner C.F., Mittelbach G., Gough L., Dodson S.I., Juday G.P., Parmenter R. (1999). The Relationship between Productivity and Species Richness. Annual Review of Ecology and Systematics 30:257–300

  78. Zimmerman, A.S. (2003): Data Sharing and Secondary Use of Scientific Data: Experiences of ecologists. Ph.D. Dissertation, University of Michigan

Download references

Acknowledgements

This work is partially supported by an NSF/SBE/SES Human Social Dynamics grant #04-33369. The work is conducted in collaboration with the LTER community (NSF/OCE #04-17616, NSF/OPP #02-17282 and #04-05069). The fieldwork was conducted in 2002, and we offer our special thanks to Geoffrey C. Bowker for collaboration in the BDEI project (NSF/DGO #EIA-01-31958). Furthermore, we thank the anonymous reviewers for their constructive comments.

Author information

Correspondence to Helena Karasti.

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Karasti, H., Baker, K.S. & Halkola, E. Enriching the Notion of Data Curation in E-Science: Data Managing and Information Infrastructuring in the Long Term Ecological Research (LTER) Network. Comput Supported Coop Work 15, 321–358 (2006). https://doi.org/10.1007/s10606-006-9023-2

Download citation

Keywords

  • cyberinfrastructure
  • data stewardship
  • information management
  • ecology
  • long-term perspective
  • scientific collaboration