Abstract
This paper aims to enrich the current understanding of data curation prevalent in e-Science by drawing on an ethnographic study of one of the longest-running efforts at long-term consistent data collection with open data sharing in an environment of interdisciplinary collaboration. In such a context we identify a set of salient characteristics of ecological research and data that shape the data stewardship approach of the Long Term Ecological Research (LTER) network. We describe the actual practices through which LTER information managers attend to the extended temporal scale of long-term research and data sets both through data care work and information infrastructure development. We discuss the issues of long-term and continuity that represent central challenges for data curation and stewardship. We argue for more efforts to be directed to understanding what is at stake with a long-term perspective and differing temporal scales as well as to studying actual practices of data curation and stewardship in order to provide more coherent understandings of e-Science solutions and technologies.
Similar content being viewed by others
References
Ackerman M.S., Halverson C. (2004). Organizational Memory as Objects, Processes, and Trajectories: An Examination of Organizational Memory in Use. Computer Supported Cooperative Work (CSCW), The Journal of Collaborative Computing 13(2):155–189
Arzberger, P., P. Scroeder, A. Beaulieu, G. Bowker, K. Casey, L. Laaksonen, D. Moorman, P. Uhlir and P. Wouters (2003): Promoting Access to Public Research Data for Scientific, Economic and Social Development, OECD Follow Up Group on Issues of Access to Publicly Funded Research Data, Final Report. Available: http://www.dataaccess.ucsd.edu/ Final_Report_2003.pdf [Last referenced: 23.05.2006]
Atkins, D.E., K.K. Droegemeier, S.I. Feldman, H. Garcia-Molina, M.L. Klein, D.G. Messerschmitt, P. Messina, J.P. Ostriker and M.H. Wright (2003): Revolutionizing Science and Engineering Through Cyberinfrastructure, Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure [Web-document]. Available: http://www.communitytechnology.org/nsf_ci_report/ [Last referenced: 23.05.2006]
Baker K.S., Benson B.J., Henshaw D.L., Blodgett D., Porter J.H., Stafford S.G. (2000). Evolution of a Multisite Network Information System: The LTER Information Management Paradigm. BioScience 50(11):963–978
Baker, K.S., D. Ribes, F. Millerand and G.C. Bowker (2005): Interoperability Strategies for Scientific Cyberinfrastruture: Research and Practice. American Society for Information Systems and Technology. In 05ASIST. American Society of Information Science and Technology, Proceedings Bringing Research and Practice Together, Charlotte, North Carolina, October 28 to November 02, 2005
Bertelsen O.W., Bødker S. (2001). Cooperation in Massively Distributed Information Spaces. In: Prinz W., Jarke M., Rogers Y., Schmidt K., Wulf V. (eds), ECSCW. Seventh European Conference on Computer-Supported Cooperative Work, September 16 to 20, 2001. Bonn, Germany, Dordrecht, Kluwer Academic Publishers, pp. 1–17
Birnholtz, J.P. and M.J. Bietz (2003): Data at Work: Supporting Sharing in Science and Engineering. In M. Tremaine (ed.): GROUP’03. Proceedings of the 2003 International ACM SIGGROUP Conference on Supporting Group Work, 2003 November 9 to 12, 2003. ACM Press, pp. 339–348
Bowker G.C. (2000). Biodiversity Datadiversity. Social Studies of Science 30(5):643–683
Brand S. (1994). How Buildings Learn. What Happens After They’re Built. New York, Viking, pp. 243
Buneman, P., L. Lyon and C. Rusbridge (2005): Comments from the Digital Curation Centre on Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century, a draft report of the National Science Board. Available: http://www.dcc.ac.uk/docs/nsbreport.pdf [Last referenced: 23.05.2006]
Callahan J.T. (1984). Long-Term Ecological Research. BioScience 34(6):363–367
CCSDS 650.0-B-1 (2002): Reference Model for an Open Archival Information System (OAIS). Washington, DC, USA: National Aeronautics and Space Administration
Chervenak A, Foster I., Kesselman C., Salisbury C., Tuecke S. (1999). The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets. Journal of Network and Computer Applications 23(3):187–200
Chin, G. Jr. and C.S. Lansing (2004): Capturing and Supporting Contexts for Scientific Data Sharing via the Biological Sciences Collaboratory. In CSCW’04. ACM Conference on Computer Supported Cooperative Work, November 6 to 10, 2004. Chicago, Illinois, USA, pp. 409–418
Christiansen E. (1997). Gardening: A Metaphor for Sustainability in Information Technology-Technical Support. In: Berleur J., Whitehouse D. (eds), An Ethical Global Information Society: Culture and Democracy Revisited. London, Chapman & Hall
Dittrich, Y., S. Eriksén and C. Hansson (2002): PD in the Wild: Evolving Practices of Design in Use. In T. Binder, J. Gregory and I. Wagner (eds.): PDC’02. Proceedings of the Participatory Design Conference, Malmö, Sweden, June 23 to 25, 2002. Palo Alto, CA: CPSR, pp. 124–134
Driscoll C.T., Lawrence G.B., Bulger A.J., Butler T.J., Cronan C.S., Eagar C., Lambert K.F., Likens G.E., Stoddard J.L., Weathers K.C. (2001). Acidic Deposition in The Northeastern United States: Sources and Inputs, Ecosystem Effects, and Management Strategies. BioScience 51:180–198
Finholt T.A., Olson G.M. (1997). From Laboratories to Collaboratories: A New Organizational Form for Scientific Collaboration. Psychological Science 8(1):28–36
Fisher G., Ostwald J. (2002). Seeding, Evolutionary Growth, and Reseeding: Enriching Participatory Design with Informed Participation. In: Binder T., Gregory J., Wagner I. (eds), PDC’02. Participatory Design Conference, Malmö, Sweden, June 23–25, 2002. Palo Alto, CA, CPSR, pp. 135–143
Fischer G., Giaccardi E., Ye Y., Sutcliffe A.G., Mehandjiev N. (2004). Meta-design: A manifesto for End-User Development. Communications of the ACM 47(9):33–37
Franklin M., Halevy A., Maier D. (2005). From Databases to Dataspaces: A New Abstraction for Information Management. SIGMOD Record 34(4):27–33
Gray J., Liu D.T., Nieto-Santisteban M., Szalay A., DeWitt D.J., Heber G. (2005). Scientific Data Management in the Coming Decade. SIGMOD Record 34(4):34–41
Greenbaum J.M., Kyng M. (1991). Design at Work: Cooperative Design of Computer Systems. Hillsdale, New Jersey, Lawrence Erlbaum Associates
Greif I., Sarin S. (1987). Data Sharing in Group Work. ACM Transactions on Office Information Systems 5(2):187–211
Grimm N.B., Redman C.L. (2004). Approaches to The Study of Urban Ecosystems: The Case of Central Arizona-Phoenix. Urban Ecosystems 7:199–213
Gross, K.L., C.E. Pake, E. Allen, C. Bledsoe, R. Colwell, P. Dayton, M. Dethier, J. Helly, R. Holt, N. Morin, W. Michener, S.T. A. Pickett and S. Stafford (1995): Final Report of the Ecological Society of America Committee on the Future of Long-term Ecological Data (FLED), Volume I: Text of the Report. Washington, DC: The Ecological Society of America
Hanseth O., Monteiro E., Hatling M. (1996). Developing Information Infrastructure: The Tension between Standardisation and Flexibility. Science, Technology, & Human Values 21(4):407–426
Harmon M.E., Nadelhoffer K.J., Blair J.M. (1999). Measuring Decomposition, Nutrient Turnover, and Stores in Plant Litter. In: Robertson G.P., Bledsoe C.S., Coleman D.C., Sollins P. (eds), Standard Soil Methods for Long Term Ecological Research. New York, Oxford University Press, pp. 202–240
Hayden B.P. (2000). Climate Change and Exratropical Storminess in the United States: An Assessment. Journal of American Water Resources Association 35(6):1387–1397
Hedstrom, M. (2003): It’s About Time: Research Challenges in Digital Archiving and Long-term Preservation, Final Report. Workshop on Research Challenges in Digital Archiving and Long-term Preservation, April 12 to 13, 2002. Sponsored by the National Science Foundation and The Library of Congress
Helly J.J., Todd Elvins T., Sutton D., Martinez D., Miller S.E., Pickett S., Ellison A.M. (2002). Controlled Publication of Digital Scientific Data. Communications of the ACM 45(5):97–101
Henderson A., Kyng M. (1991). There’s No Place like Home: Continuing Design in Use. In: Greenbaum J., Kyng M. (eds), Design at Work. London, New Jersey, Lawrence Erlbaum
Hey, T. and A.E. Trefethen (2003): The Data Deluge: An e-Science Perspective. In F. Berman, G. Fox and T. Hey (eds.): Wiley Grid Computing: Making the Global Infrastructure a reality. John Wiley & Sons Ltd, pp. 809–824
Hilgartner S. (1995). Biomolecular Databases: New Communication Regimes for Biology? Science Communication 17(2):240–263
Hobbie J.E., Carpenter S.R., Grimm N.B., Gosz J.R., Seastedt T.R. (2003). The US Long Term Ecological Research Program. BioScience 53(1):21–32
Hodge, G. and E. Frangakis (2004): Digital Preservation and Permanent Access to Scientific Information: The State of the Practice (CENDI/04–3), The International Council for Scientific and Technical Information (ICSTI) and CENDI (U.S. Federal Information Managers Group). February 2004, Revised April 2004. Available: http://www.icsti.org/icsti/icsti_reports.html [Last referenced: 23.05.2006]
Jirotka M., Procter R., Hartswood M., Slack R., Simpson A., Coopmans C., Hinds C., Voss A. (2005). Collaboration and Trust in Healthcare Innovation: The eDiaMoND Case Study. Computer Supported Cooperative Work (CSCW), The Journal of Collaborative Computing 14(4):369–398
Johansen R. (1988). Groupware. Computer Support for Business Teams. New York, The Free Press
Kanstrup, A.M. (2005): Local Design: An Inquiry into Work Practices of Local IT Supporters. PhD theses. Department of Communication, Aalborg University, Denmark
Kaplan, S. and L. Seeback (2001): Harnessing Complexity. In ECSCW. Proceedings of the Seventh European Conference on Computer Supported Cooperative Work, September 16 to 20, 2001, Bonn, Germany. Netherlands: Kluwer Academic Publishers, pp. 359–397
Karasti, H. and K.S. Baker (2004): Infrastructuring for the Long-Term: Ecological Information Management. In HICSS’3. Proceedings of the Hawaii International Conference on System Sciences 2004, Hawaii, USA, January 5 to 8, 2004
Karasti, H. and A.-L. Syrjänen (2004): Artful Infrastructuring in Two Cases of Community PD. In PDC 04. Proceedings of the Eighth Conference on Participatory Design: Artful integration: interweaving Media, Materials and Practices, Volume 1, Toronto, Ontario, Canada, July 27 to 31, 2004. New York: ACM Press, pp. 20–30
Lamb R., Davidson E. (2005). Information and Communication Technology Challenges to Scientific Professional Identity. The Information Society 21(1):1–24
Lave J., Wenger E. (1991). Situated Learning: Legitimate Peripheral Participation. Cambridge, Cambridge University Press
Likens G.E. (1989). Long-Term Studies in Ecology: Approaches and Alternatives. New York, Springer-Verlag
Lord, P. and A. Macdonald (2003): e-Science Curation Report-Data Curation for e-Science in the UK: An Audit to Establish Requirements for Future Curation and Provision. Prepared for the JISC Committee for the Support of Research (JCSR). Twickenham, UK, The Digital Archiving Consultancy Limited. Available: http://www.jisc.ac.uk/uploaded_documents/ e-ScienceReportFinal.pdf [Last referenced: 23.05.2006]
Lord, P., A. Macdonald, L. Lyon and D. Giarretta (2004): From Data Deluge to Data Curation. In Proceedings of the UK e-science All Hands meeting 2004, pp. 371–375
Magnuson J.J. (1990). Long-Term Ecological Research and the Invisible Present. BioScience 40(7):495–501
Magnuson J.J., Rogbertson D.M., Benson B.J., Wynne R.H., Livingsone D.M., Arai T., Assel R.A., Barry R.G., Card V., Kuusisto E., Granin N.G., Prowse T.D., Stewart K.M., Vuglinski V.S. (2000). Historical Trends in Lake and River Ice Cover in The Northern Hemisphere. Science 289:1743–1746
Markus L.M. (2001). Toward a Theory of Knowledge Reuse: Types of Knowledge Reuse Situations and Factors in Reuse Success. Journal of Management Information Systems 18(1):57–93
Michener W.K. (2006). Meta-Information Concepts for Ecological Data Management. Ecological Informatics 1:3–7
Michener W.K., Brunt J.W., Helly J.J., Kirchner T.B., Stafford S.G. (1997). Nongeospatial Metadata for the Ecological Sciences. Ecological Applications 7(1):330–342
National Science Board (2005): Long Lived Digital Data Collections: Enabling Research and Education in the 21st Century, National Science Board (NSB-05-40, Revised May 23, 2005). Available: http://www.nsf.gov/pubs/2005/nsb0540/ [Last referenced: 23.05.2006]
Newman H.B., Ellisman M.H., Orcutt J.A. (2003). Data-Intensive e-Science Frontier Research. Communications of the ACM 46(11):68–77
OECD Global Science Forum (2005): Organisation for Economic Co-operation and Development Global Science Forum Report on Grids and Basic Research Programmes. Final consensus report from the OECD Global Science Forum Workshop, Sydney, Australia, September 25–27, 2005
O’Day, V.L., A. Adler, A. Kuchinsky and A. Bouch (2001): When Worlds Collide: Molecular Biology as Interdisciplinary Collaboration. In W. Prinz, M. Jarke, Y. Rogers, K. Schmidt and V. Wulf (eds.): ECSCW. Seventh European Conference on Computer-Supported Cooperative Work, September 16 to 20, 2001, Bonn, Germany. Netherlands: Kluwer Academic Publishers, pp. 399–418
Pickett S.T.A., Burch W.R., Grove J.M. (1999). Interdisciplinary Research: Maintaining the Constructive Impulse in a Culture of Criticism. Ecosystems 2:302–307
Pipek, V. (2005): From Tailoring to Appropriation Support: Negotiating Groupware Usage. Doctoral thesis. Acta Universitatis Ouluensis, Series A, Scientiae rerum naturalium nro 430. Oulu 2005
Rolland K.H., Monteiro E. (2002). Balancing the Local and the Global in Infrastructural Information Systems. The Information Society 18:87–100
Rolland, K.H., V. Hepsø and E. Monteiro (2006): (Re)Conceptualizing Common Information Spaces across Heterogeneous Contexts: Im/Mutable Mobiles and Imperfection. Accepted for CSCW’06. ACM Conference on Computer Supported Cooperative Work
Sandusky R.J. (2003). Infrastructure Management as Cooperative Work: Implications for Systems Design. Computer Supported Cooperative Work (CSCW), The Journal of Collaborative Computing 12:97–122
Schmidt K., Bannon L. (1992). Taking CSCW Seriously: Supporting Articulation Work. Computer Supported Cooperative Work (CSCW), The Journal of Collaborative Computing 1(1–2):7–40
Schuler D., Namioka A. (eds) (1993). Participatory Design: Principles and Practices. Hillsdale, NJ, Lawrence Erlbaum Associates
Sonnenwald, D.H. (2003): Expectations for a Scientific Collaboratory: A Case Study. In GROUP ‘03. Proceedings of the International ACM SIGGROUP Conference on Supporting Group Work 2003. November 9 to 12, 2003. Sanibel Island, Florida, USA, pp.68–74
Star S.L., Bowker G.C. (2002). How to Infrastructure. In Lievrouw L.A., Livingstone S. (eds), Handbook of New Media: Social Shaping and Consequences of ICTs. London, SAGE Publications, pp. 151–162
Star S.L., Griesemer J.R. (1989). Institutional Ecology, ‘Translations’ and Boundary Objects: Amateurs and Professionals in Berkeley’s Museum of Vertebrate Zoology, 1907–39. Social Studies of Science 19:387–420
Star S.L., Ruhleder K. (1996). Steps Toward an Ecology of Infrastructure: Design and Access for Large Information Spaces. Information Systems Research 7:111–133
Star S.L., Strauss A. (1999). Layers of Silence, Arenas of Voice: The Ecology of Visible and Invisible Work. Computer Supported Cooperative Work (CSCW), The Journal of Collaborative Computing 8(1–2):9–30
Sterling, T.D. and J.J. Weinkam (1990): Sharing Scientific Data. Communications of the ACM, ACM Press, vol. 33, no. 8, pp. 112–119
Strauss A.L. (1975). Chronic Illness and the Quality of Life. Saint Louis, The C. V. Mosby Company
Strauss A.L., Fagerhaugh S., Suczek B., Wiener C. (1985). Social Organization of Medical Work. Chicago, University of Chicago Press
Suchman L. (1995): Special Issue: Representations of Work. Communications of the ACM 38(9):33–68
Suchman, L. (2000): Located Accountabilities in Technology Production. Work-in-progress, revision of (Suchman, 1994), presented at the Sawyer Seminar on Heterarchies, Santa Fe Institute, October 2000
Suchman L., Blomberg J.e, Orr J.E., Trigg R. (1999). Reconstructing Technologies as Social Practice. American Behavioral Scientist 43(3):392–408
UK Research Council e-Science definition (2001): Available: http://www.rcuk.ac.uk/escience/. [Last referenced: 23.05.2006]
Van House, N.A., M.H. Butler and L.R. Schiff (1998): Cooperative Knowledge Work and Practices of Trust: Sharing Environmental Planning Data Sets. In CSCW ‘98. Proceedings of the ACM Conference On Computer Supported Cooperative Work, November 14 to 18, 1998. Seattle, WA: ACM, pp. 335–343
Waide R.B., Willig M.R., Steiner C.F., Mittelbach G., Gough L., Dodson S.I., Juday G.P., Parmenter R. (1999). The Relationship between Productivity and Species Richness. Annual Review of Ecology and Systematics 30:257–300
Zimmerman, A.S. (2003): Data Sharing and Secondary Use of Scientific Data: Experiences of ecologists. Ph.D. Dissertation, University of Michigan
Acknowledgements
This work is partially supported by an NSF/SBE/SES Human Social Dynamics grant #04-33369. The work is conducted in collaboration with the LTER community (NSF/OCE #04-17616, NSF/OPP #02-17282 and #04-05069). The fieldwork was conducted in 2002, and we offer our special thanks to Geoffrey C. Bowker for collaboration in the BDEI project (NSF/DGO #EIA-01-31958). Furthermore, we thank the anonymous reviewers for their constructive comments.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Karasti, H., Baker, K.S. & Halkola, E. Enriching the Notion of Data Curation in E-Science: Data Managing and Information Infrastructuring in the Long Term Ecological Research (LTER) Network. Comput Supported Coop Work 15, 321–358 (2006). https://doi.org/10.1007/s10606-006-9023-2
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10606-006-9023-2