, Volume 68, Issue 8, pp 2031–2034 | Cite as

Materials Data Infrastructure: A Case Study of the Citrination Platform to Examine Data Import, Storage, and Access

  • Jordan O’Mara
  • Bryce Meredig
  • Kyle MichelEmail author


Considerations are presented around the design of a materials data infrastructure including import of structured and unstructured data, storage of that data for archival and retrieval, and access to that data through programmatic and graphical interfaces. In particular, the choices around technologies used in such an infrastructure, the benefits and drawbacks of those technologies, and their impact on the experience of users of that system are presented. The Citrination platform is used as an example of a materials data infrastructure and the choices made around architecture are discussed.


Relational Database Complex Query Unstructured Data Query Response Time Inorganic Crystal Structure Database 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    J.H. Westbrook and J. R. Rumble Jr., Computerized Materials Data Systems (Office of Standard Reference Data, National Bureau of Standards, 1983).Google Scholar
  2. 2.
    Materials Genome Initiative National Science and Technology Council Committee on Technology Subcommittee on the Materials Genome Initiative (National Science and Technology Council Committee on Technology Subcommittee on the Materials Genome Initiative, Washington, 2014).Google Scholar
  3. 3.
    J.P. Holdren, “Increasing access to the results of federally funded scientific research. Accessed 26 May 2016.
  4. 4.
    T. Austin, Mater. Discov. (2016). doi: 10.1016/
  5. 5.
    The NoMaD Repository, Accessed 26 May 2016.
  6. 6.
    Y. Xu, M. Yamazaki, and P. Villars, Jpn. J. Appl. Phys. 50, 11RH02 (2011).CrossRefGoogle Scholar
  7. 7.
    A. Belsky, M. Hellenbrandt, V.L. Karen, and P. Luksch, Acta Cryst. B 58, 364 (2002).CrossRefGoogle Scholar
  8. 8.
    A. Jain, S.P. Ong, G. Hautier, W. Chen, W.D. Richards, S. Dacek, S. Cholia, D. Gunter, D. Skinner, G. Ceder, and K.A. Persson, Apl Mater. 1, 011002 (2013).CrossRefGoogle Scholar
  9. 9.
    H.E. Pence and A. Williams, J. Chem. Edu. 87, 1123 (2010).CrossRefGoogle Scholar
  10. 10.
    S.R. Hall, B. McMahon (Eds), International Tables for Crystallography Volume G: Definition and exchange of crystallographic data (Springer, The Netherlands, 2005).Google Scholar
  11. 11.
    Materials Commons, Accessed 26 May 2016.
  12. 12.
    NIST Repositories, Accessed 26 May 2016.
  13. 13.
    I. Foster, R. Ananthakrishnan, B. Blaiszik, K. Chard, R. Osborn, S. Tuecke, M. Wilde, and J.M. Wozniak, Adv. Par. Com. 26, 117 (2015).Google Scholar
  14. 14.
    Dryad Digital Reposity, Accessed 26 May 2016.
  15. 15.
    Figshare, Accessed 26 May 2016.
  16. 16.
    Citrination, Accessed 26 May 2016.
  17. 17.
    J.A. Warren, and R.F. Boisvert, Building the Materials Innovation Infrastructure: Data and Standards, NISTIR 7898.Google Scholar
  18. 18.
    C.H. Ward, and J.A. Warren, Materials Genome Initiative: Materials Data, NISTIR 8038.Google Scholar
  19. 19.
    NIST Materials Data Curation System, Accessed 26 May 2016.
  20. 20.
    P. Huck, A. Jain, D. Gunter, D. Winston, and K. Persson, A Community Contribution Framework for Sharing Materials Data with Materials Project, arXiv:1510.05024v1.
  21. 21.
    K. Michel and B. Meredig, Citrine Informatics. Redwood City, CA, unpublished research, 2016.Google Scholar
  22. 22.
    PIF Documentation, Accessed 26 May 2016.
  23. 23.
    Pypif, Accessed 26 May 2016.
  24. 24.
    Jpif, Accessed 26 May 2016.
  25. 25.
    J. Shin, S. Wu, F. Wang, C. De Sa, C. Zhang, and C. Re, Proc. VLDB Endow. 8, 1310 (2015).CrossRefGoogle Scholar
  26. 26.
    E.F. Codd, Commun. ACM 25, 109 (1982).CrossRefGoogle Scholar
  27. 27.
    S. Sumathi and S. Esakkirajan, Fundamentals of Relational Database Management Systems (Springer, The Netherlands, 2007).Google Scholar
  28. 28.
    M. Mesnier, G.R. Ganger, and E. Riedel, IEEE Commun. Mag. 41, 84 (2003).CrossRefGoogle Scholar
  29. 29.
    P.J. Sadalage and M. Fowler, NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence (Boston: Addison-Wesley Professional, 2012).Google Scholar
  30. 30.
    Lucene, Accessed 26 May 2016.
  31. 31.
    Solr, Accessed 26 May 2016.
  32. 32.
    ElasticSearch, Accessed 26 May 2016.
  33. 33.
    Citrination API Documentation, Accessed 26 May 2016.
  34. 34.
    Citrine Informatics, Accessed 26 May 2016.

Copyright information

© The Minerals, Metals & Materials Society 2016

Authors and Affiliations

  1. 1.Citrine Informatics, Inc.Redwood CityUSA

Personalised recommendations