Optimizing Access Policies for Big Data Repositories: Latency Variables and the Genome Commons

Part of the Studies in Big Data book series (SBD, volume 18)


The design of access policies for large aggregations of scientific data has become increasingly important in today’s data-rich research environment. Planners routinely consider and weigh different policy variables when deciding how and when to release data to the public. This chapter proposes a methodology in which the timing of data release can be used to balance policy variables and thereby optimize data release policies. The global aggregation of publicly-available genomic data, or the “genome commons” is used as an illustration of this methodology.


Commons Genome Data sharing Latency 


  1. 1.
    Contreras, J.L.: Prepublication data release, latency and genome commons. Science 329, 393–94 (2010a)Google Scholar
  2. 2.
    Contreras, J.L.: Data Sharing, latency variables and science commons. Berkeley Tech. L.J. 25, 1601–1672 (2010b)Google Scholar
  3. 3.
    Ostrom, E., Hess, C.: A framework for analyzing the knowledge commons. In: Hess, C., Ostrom, E. (eds.) Understanding Knowledge as a Commons: From Theory to Practice. MIT Press, Cambridge, Mass (2007)Google Scholar
  4. 4.
    Benson, B.A., Clark, K., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Sayers, E.W.: GenBank. Nucleic Acids Res. 42, D32–D37 (2014). doi: 10.1093/nar/gkt1030 CrossRefGoogle Scholar
  5. 5.
    Natl. Ctr. Biotechnology Info. (NCBI): Growth of GenBank and WGS. (2015). Accessed 14 June 2015
  6. 6.
    Pennisi, E.: Will computers crash genomics? Science 331, 666–667 (2011)CrossRefGoogle Scholar
  7. 7.
    Natl. Res. Council (NRC): Mapping and Sequencing the Human Genome. Natl. Acad. Press, Washington (1988)Google Scholar
  8. 8.
    Oak Ridge Natl. Lab. (ORNL): NIH, DOE guidelines encourage sharing of data, resources. Hum. Genome News 4, 4. (1993)
  9. 9.
    Contreras, J.L.: Bermuda’s legacy: policy, patents, and the design of the genome commons. Minn. J.L. Sci. Tech. 12, 61–125 (2011)Google Scholar
  10. 10.
    Natl. Res. Council (NRC): Bits of Power—Issues in Global Access to Scientific Data. Natl. Acad. Press, Washington (1997)Google Scholar
  11. 11.
    Reichman, J.H., Uhlir, P.F.: A contractually reconstructed research commons for scientific data in a highly protectionist intellectual property environment. Law Contemp. Probs. 66, 315–462 (2003)Google Scholar
  12. 12.
    Intl. Human Genome Sequencing Consortium (IHGSC): Initial sequencing and analysis of the human genome. Nature 409, 860–914 (2001)CrossRefGoogle Scholar
  13. 13.
    Bermuda Principles: Summary of principles agreed at the first international strategy meeting on human genome sequencing. (2006)
  14. 14.
    Kaye, J., et al.: Data sharing in genomics—re-shaping scientific practice. Nat. Rev. Genet. 10, 331–335 (2009)CrossRefGoogle Scholar
  15. 15.
    Wellcome Trust: Sharing Data from Large-Scale Biological Research Projects: A System of Tripartite Responsibility: Report of meeting organized by the Wellcome Trust and held on 14–15 January 2003 at Fort Lauderdale, USA. (2003)
  16. 16.
    Natl. Inst. Health (NIH): Policy for sharing of data obtained in NIH supported or conducted Genome-Wide Association Studies (GWAS). Fed. Reg. 72, 49,290 (2007)Google Scholar
  17. 17.
    Merck & Co., Inc.: First installment of merck gene index data released to public databases: cooperative effort promises to speed scientific understanding of the human genome. (1995)
  18. 18.
    Marshall, E.: Bermuda rules: community spirit, with teeth. Science 291, 1192–1193 (2001)CrossRefGoogle Scholar
  19. 19.
    Holden, A.L.: The SNP consortium: summary of a private consortium effort to develop an applied map of the human genome. Biotechniques 32, 22–26 (2002)Google Scholar
  20. 20.
    Contreras, J.L., Floratos, A., Holden, A.L.: The international serious adverse events consortium’s data sharing model. Nat. Biotech. 31, 17–19 (2013)CrossRefGoogle Scholar
  21. 21.
    Personal Genome Project (PGP): About the PGP. (2014). Accessed 25 June 2014
  22. 22.
    Natl. Inst. Health (NIH): Final NIH genomic data sharing policy. Fed. Reg. 79, 51345–51354 (2014)Google Scholar
  23. 23.
    Contreras, J.L.: NIH’s genomic data sharing policy: timing and tradeoffs. Trends Genet. 31, 55–57 (2015)CrossRefGoogle Scholar
  24. 24.
    Kunz, C.L., et al.: Click-through agreements: strategies for avoiding disputes on validity of assent. Bus. Lawyer 57, 401–429 (2001)Google Scholar
  25. 25.
    Delta, G.B., Matsuura, J.H. (eds.): Law of the Internet, 3rd edn. Aspen, New York (2014)Google Scholar
  26. 26.
    Rai, A.K., Eisenberg, R.S.: Bayh-Dole reform and the progress of biomedicine. Law Contemp. Probs. 66, 289–314 (2003)Google Scholar
  27. 27.
    GAIN Collaborative Research Group: New models of collaboration in genome-wide association studies: the genetic association information network. Nat. Genet. 39, 1045–1051 (2007)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  1. 1.University of Utah, S.J. Quinney College of LawSalt Lake CityUSA

Personalised recommendations