top of page

Community Accessible Datastore of High-Throughput Calculations: Experiences from the Materials Project

Dan Gunter, Shreyas Cholia, Anubhav Jain, Michael Kocher, Kristin Persson, Lavanya Ramakrishnan, Shyue Ping Ong, Gerbrand Ceder

Efforts such as the Human Genome Project provided a dramatic example of opening scientific datasets to the community. Making high quality scientific data accessible through an online database allows scientists around the world to multiply the value of that data through scientific innovations. Similarly, the goal of the Materials Project is to calculate physical properties of all known inorganic materials and make this data freely available, with the goal of accelerating to invention of better materials. However, the complexity of scientific data, and the complexity of the simulations needed to generate and analyze it, pose challenges to current software ecosystem. In this paper, we describe the approach we used in the Materials Project to overcome these challenges and create and disseminate a high quality database of materials properties computed by solving the basic laws of physics. Our infrastructure requires a novel combination of highthroughput approaches with broadly applicable and scalable approaches to data storage and dissemination.

National University of Singapore
College of Design and Engineering
Department of Materials Science and Engineering
9 Engineering Drive 1, Blk EA, #03-09
Singapore 117575
Singapore 

  • Linkedin
  • Youtube
  • GitHub

 

© 2026 by Materialyze.AI.

bottom of page