Abstract
Considerations are presented around the design of a materials data infrastructure including import of structured and unstructured data, storage of that data for archival and retrieval, and access to that data through programmatic and graphical interfaces. In particular, the choices around technologies used in such an infrastructure, the benefits and drawbacks of those technologies, and their impact on the experience of users of that system are presented. The Citrination platform is used as an example of a materials data infrastructure and the choices made around architecture are discussed.
This is a preview of subscription content, access via your institution.

References
J.H. Westbrook and J. R. Rumble Jr., Computerized Materials Data Systems (Office of Standard Reference Data, National Bureau of Standards, 1983).
Materials Genome Initiative National Science and Technology Council Committee on Technology Subcommittee on the Materials Genome Initiative (National Science and Technology Council Committee on Technology Subcommittee on the Materials Genome Initiative, Washington, 2014).
J.P. Holdren, “Increasing access to the results of federally funded scientific research. https://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_public_access_memo_2013.pdf. Accessed 26 May 2016.
T. Austin, Mater. Discov. (2016). doi:10.1016/j.md.2015.12.003.
The NoMaD Repository, http://nomad-repository.eu. Accessed 26 May 2016.
Y. Xu, M. Yamazaki, and P. Villars, Jpn. J. Appl. Phys. 50, 11RH02 (2011).
A. Belsky, M. Hellenbrandt, V.L. Karen, and P. Luksch, Acta Cryst. B 58, 364 (2002).
A. Jain, S.P. Ong, G. Hautier, W. Chen, W.D. Richards, S. Dacek, S. Cholia, D. Gunter, D. Skinner, G. Ceder, and K.A. Persson, Apl Mater. 1, 011002 (2013).
H.E. Pence and A. Williams, J. Chem. Edu. 87, 1123 (2010).
S.R. Hall, B. McMahon (Eds), International Tables for Crystallography Volume G: Definition and exchange of crystallographic data (Springer, The Netherlands, 2005).
Materials Commons, https://materialscommons.org. Accessed 26 May 2016.
NIST Repositories, https://materialsdata.nist.gov/dspace/xmlui. Accessed 26 May 2016.
I. Foster, R. Ananthakrishnan, B. Blaiszik, K. Chard, R. Osborn, S. Tuecke, M. Wilde, and J.M. Wozniak, Adv. Par. Com. 26, 117 (2015).
Dryad Digital Reposity, http://datadryad.org. Accessed 26 May 2016.
Figshare, https://figshare.com. Accessed 26 May 2016.
Citrination, https://citrination.com. Accessed 26 May 2016.
J.A. Warren, and R.F. Boisvert, Building the Materials Innovation Infrastructure: Data and Standards, NISTIR 7898.
C.H. Ward, and J.A. Warren, Materials Genome Initiative: Materials Data, NISTIR 8038.
NIST Materials Data Curation System, https://mgi.nist.gov/materials-data-curation-system. Accessed 26 May 2016.
P. Huck, A. Jain, D. Gunter, D. Winston, and K. Persson, A Community Contribution Framework for Sharing Materials Data with Materials Project, arXiv:1510.05024v1.
K. Michel and B. Meredig, Citrine Informatics. Redwood City, CA, unpublished research, 2016.
PIF Documentation, http://www.citrine.io/pif. Accessed 26 May 2016.
Pypif, http://www.citrine.io/pypif. Accessed 26 May 2016.
Jpif, http://www.citrine.io/jpif. Accessed 26 May 2016.
J. Shin, S. Wu, F. Wang, C. De Sa, C. Zhang, and C. Re, Proc. VLDB Endow. 8, 1310 (2015).
E.F. Codd, Commun. ACM 25, 109 (1982).
S. Sumathi and S. Esakkirajan, Fundamentals of Relational Database Management Systems (Springer, The Netherlands, 2007).
M. Mesnier, G.R. Ganger, and E. Riedel, IEEE Commun. Mag. 41, 84 (2003).
P.J. Sadalage and M. Fowler, NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence (Boston: Addison-Wesley Professional, 2012).
Lucene, https://lucene.apache.org. Accessed 26 May 2016.
Solr, http://lucene.apache.org/solr. Accessed 26 May 2016.
ElasticSearch, https://www.elastic.co/products/elasticsearch. Accessed 26 May 2016.
Citrination API Documentation, http://www.citrine.io/api. Accessed 26 May 2016.
Citrine Informatics, http://www.citrine.io. Accessed 26 May 2016.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
O’Mara, J., Meredig, B. & Michel, K. Materials Data Infrastructure: A Case Study of the Citrination Platform to Examine Data Import, Storage, and Access. JOM 68, 2031–2034 (2016). https://doi.org/10.1007/s11837-016-1984-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11837-016-1984-0
Keywords
- Relational Database
- Complex Query
- Unstructured Data
- Query Response Time
- Inorganic Crystal Structure Database