Skip to main content

Materials Data Infrastructure: A Case Study of the Citrination Platform to Examine Data Import, Storage, and Access


Considerations are presented around the design of a materials data infrastructure including import of structured and unstructured data, storage of that data for archival and retrieval, and access to that data through programmatic and graphical interfaces. In particular, the choices around technologies used in such an infrastructure, the benefits and drawbacks of those technologies, and their impact on the experience of users of that system are presented. The Citrination platform is used as an example of a materials data infrastructure and the choices made around architecture are discussed.

This is a preview of subscription content, access via your institution.

Fig. 1


  1. J.H. Westbrook and J. R. Rumble Jr., Computerized Materials Data Systems (Office of Standard Reference Data, National Bureau of Standards, 1983).

  2. Materials Genome Initiative National Science and Technology Council Committee on Technology Subcommittee on the Materials Genome Initiative (National Science and Technology Council Committee on Technology Subcommittee on the Materials Genome Initiative, Washington, 2014).

  3. J.P. Holdren, “Increasing access to the results of federally funded scientific research. Accessed 26 May 2016.

  4. T. Austin, Mater. Discov. (2016). doi:10.1016/

  5. The NoMaD Repository, Accessed 26 May 2016.

  6. Y. Xu, M. Yamazaki, and P. Villars, Jpn. J. Appl. Phys. 50, 11RH02 (2011).

    Article  Google Scholar 

  7. A. Belsky, M. Hellenbrandt, V.L. Karen, and P. Luksch, Acta Cryst. B 58, 364 (2002).

    Article  Google Scholar 

  8. A. Jain, S.P. Ong, G. Hautier, W. Chen, W.D. Richards, S. Dacek, S. Cholia, D. Gunter, D. Skinner, G. Ceder, and K.A. Persson, Apl Mater. 1, 011002 (2013).

    Article  Google Scholar 

  9. H.E. Pence and A. Williams, J. Chem. Edu. 87, 1123 (2010).

    Article  Google Scholar 

  10. S.R. Hall, B. McMahon (Eds), International Tables for Crystallography Volume G: Definition and exchange of crystallographic data (Springer, The Netherlands, 2005).

  11. Materials Commons, Accessed 26 May 2016.

  12. NIST Repositories, Accessed 26 May 2016.

  13. I. Foster, R. Ananthakrishnan, B. Blaiszik, K. Chard, R. Osborn, S. Tuecke, M. Wilde, and J.M. Wozniak, Adv. Par. Com. 26, 117 (2015).

    Google Scholar 

  14. Dryad Digital Reposity, Accessed 26 May 2016.

  15. Figshare, Accessed 26 May 2016.

  16. Citrination, Accessed 26 May 2016.

  17. J.A. Warren, and R.F. Boisvert, Building the Materials Innovation Infrastructure: Data and Standards, NISTIR 7898.

  18. C.H. Ward, and J.A. Warren, Materials Genome Initiative: Materials Data, NISTIR 8038.

  19. NIST Materials Data Curation System, Accessed 26 May 2016.

  20. P. Huck, A. Jain, D. Gunter, D. Winston, and K. Persson, A Community Contribution Framework for Sharing Materials Data with Materials Project, arXiv:1510.05024v1.

  21. K. Michel and B. Meredig, Citrine Informatics. Redwood City, CA, unpublished research, 2016.

  22. PIF Documentation, Accessed 26 May 2016.

  23. Pypif, Accessed 26 May 2016.

  24. Jpif, Accessed 26 May 2016.

  25. J. Shin, S. Wu, F. Wang, C. De Sa, C. Zhang, and C. Re, Proc. VLDB Endow. 8, 1310 (2015).

    Article  Google Scholar 

  26. E.F. Codd, Commun. ACM 25, 109 (1982).

    Article  Google Scholar 

  27. S. Sumathi and S. Esakkirajan, Fundamentals of Relational Database Management Systems (Springer, The Netherlands, 2007).

  28. M. Mesnier, G.R. Ganger, and E. Riedel, IEEE Commun. Mag. 41, 84 (2003).

    Article  Google Scholar 

  29. P.J. Sadalage and M. Fowler, NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence (Boston: Addison-Wesley Professional, 2012).

    Google Scholar 

  30. Lucene, Accessed 26 May 2016.

  31. Solr, Accessed 26 May 2016.

  32. ElasticSearch, Accessed 26 May 2016.

  33. Citrination API Documentation, Accessed 26 May 2016.

  34. Citrine Informatics, Accessed 26 May 2016.

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Kyle Michel.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

O’Mara, J., Meredig, B. & Michel, K. Materials Data Infrastructure: A Case Study of the Citrination Platform to Examine Data Import, Storage, and Access. JOM 68, 2031–2034 (2016).

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI:


  • Relational Database
  • Complex Query
  • Unstructured Data
  • Query Response Time
  • Inorganic Crystal Structure Database