Abstract
Dynamic documents that combine text and code, which is evaluated to dynamically create content when the document is “rendered,” for example, Sweave, are a large step forward in reproducible data analysis and computation. However, to capture the research process, we need richer paradigms and infrastructure. The process includes all the investigations and computations, and not just the final reported ones, and the entirety represents reproducible research. In addition to richer paradigms for reproducability, we want to be able to capture more complex aspects of the computational process, such as the use of multiple languages, and also engage different communities using other programming languages so that reproducible computations and research become more widespread. We also need to integrate existing and future approaches with commonly used tools such as Microsoft Word and make the resulting documents richer for authors and readers. We present two approaches to structured, dynamic documents that use modern, ubiquitous standard technologies (XML) and provide extensible infrastructure for richer documents. The first integrates R and Microsoft Word for use by a broader audience and provides some innovations in this interface, and the second uses eXtensible Stylesheet Language (XSL) and R to provide a flexible and extensible infrastructure for richer, more accessible dynamic documents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Leisch F (2002) Sweave: dynamic generation of statistical reports using literate data analysis. In: Wolfgang Hardle BR (ed) Compstat 2002 proceedings in computational statistics. Physica Verlag, Heidelberg, pp 575–580
Max Kuhn SW (2008) odfWeave: sweave processing of open document format (odf) files
Mittelbach F, Goossens M et al (2006). The LaTeX companion, 2nd edn. Addison Wesley, Boston, MA
Pawson D (2002) XSL-FO: making XML look good in print. O’Reilly, Sebastopol, CA
Peng RD (2008) Caching and distributing statistical analyses in R. J Stat Software 26(7):1–24
Ramsey N (1994) Literate programming simplified. IEEE Software 11(5):97–105
R Development Core Team (2008) A language and environment for statistical computing, Vienna, Austria
Vugt WV (2007) Open XML: the markup explained. Retrieved 2009, from http://openxmldeveloper.org: http://openxmldeveloper.org/attachment/1970.ashx
Walsh N, Muellner L (1999) DocBook: The definitive guide. O’Reilly, Sebastopol, CA
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Nolan, D., Peng, R.D., Lang, D.T. (2010). Enhanced Dynamic Documents for Reproducible Research. In: Ochs, M., Casagrande, J., Davuluri, R. (eds) Biomedical Informatics for Cancer Research. Springer, Boston, MA. https://doi.org/10.1007/978-1-4419-5714-6_20
Download citation
DOI: https://doi.org/10.1007/978-1-4419-5714-6_20
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-5712-2
Online ISBN: 978-1-4419-5714-6
eBook Packages: MedicineMedicine (R0)