A Machine Learning Aided Hierarchical Screening Strategy for Materials Discovery

By Anjana Talapatra

Los Alamos National Laboratory, Los Alamos, NM

Published on

Abstract

Run the Tool: Machine Learning for Materials Science: Part 1 One of the most basic approaches to problem solving is to conceptualize the problem at different abstraction levels and translate from one abstraction level to the others easily, i.e., deal with them hierarchically. This concept is especially applicable to the field of novel materials discovery, wherein large candidate domains can be quickly and efficiently explored by hierarchically discarding irrelevant candidates. In this tutorial, we illustrate this approach using the example of wide band gap oxide perovskites. We will sequentially search a very large domain space of single and double oxide perovskites to identify candidates that are likely to be formable, thermodynamically stable, exhibit insulator nature and have a wide band gap. To this end, we will build four machine learning (ML) models: three classification and one regression model using experimental and DFT-calculated training data. The tutorial will discuss best practices for building ML models, commonly encountered pitfalls and how best to avoid them.

The corresponding tool for this tutorial can be run on nanoHUB at ML-aided High-throughput screening for Novel Oxide Perovskite Discovery.

Available for public release under LA-UR-21-27864.

Bio

Anjana Talapatra Anjana Talapatra is Director's Postdoctoral Fellow in the Materials Science in Radiation and Dynamics Extremes Group at Los Alamos National Laboratory. Her research involves combining high-throughput Density Functional Theory (DFT) and Machine-Learning methods to accelerate the search for new materials. Previously, she was employed as a Research Scientist at the Materials Science & Engineering Department at Texas A&M university after graduating with a Ph.D from the Mechanical Engineering Department in 2015. While at Texas A&M Anjana also worked on a wide variety of materials-related problems using CALPHAD techniques, DFT and Machine-Learning.

Sponsored by

Cite this work

Researchers should cite this work as follows:

  • Anjana Talapatra (2021), "A Machine Learning Aided Hierarchical Screening Strategy for Materials Discovery," https://nanohub.org/resources/35418.

    BibTex | EndNote

Time

Tags

A Machine Learning Aided Hierarchical Screening Strategy for Materials Discovery
  • A Machine Learning aided hierarchical screening strategy for materials discovery 1. A Machine Learning aided hiera… 0
    00:00/00:00
  • Discovery and Design of Novel Oxide Perovskite Scintillators 2. Discovery and Design of Novel … 115.81581581581582
    00:00/00:00
  • Objective 3. Objective 190.19019019019021
    00:00/00:00
  • A Strategy for Scintillator Discovery and Optimization 4. A Strategy for Scintillator Di… 242.87620954287621
    00:00/00:00
  • Objective 5. Objective 322.22222222222223
    00:00/00:00
  • Why Machine Learning? 6. Why Machine Learning? 335.73573573573577
    00:00/00:00
  • Machine Learning for Efficient Screening 7. Machine Learning for Efficient… 398.83216549883218
    00:00/00:00
  • Perovskite discovery using Machine Learning 8. Perovskite discovery using Mac… 460.79412746079413
    00:00/00:00
  • Components of ML infrastructure 9. Components of ML infrastructur… 546.04604604604606
    00:00/00:00
  • Components of ML infrastructure 10. Components of ML infrastructur… 591.59159159159162
    00:00/00:00
  • Components of ML infrastructure 11. Components of ML infrastructur… 727.660994327661
    00:00/00:00
  • Components of ML infrastructure 12. Components of ML infrastructur… 769.20253586920262
    00:00/00:00
  • Training data 13. Training data 836.1695028361695
    00:00/00:00
  • Features: Machine Learning models 14. Features: Machine Learning mod… 970.17017017017019
    00:00/00:00
  • Comparison of perovskite formability and stability 15. Comparison of perovskite forma… 1007.6743410076743
    00:00/00:00
  • Formability classification model 16. Formability classification mod… 1085.8191524858191
    00:00/00:00
  • Stability classification model 17. Stability classification model 1196.9302635969302
    00:00/00:00
  • Wide/narrow band gap classification model 18. Wide/narrow band gap classific… 1254.0540540540542
    00:00/00:00
  • Novel wide bandgap oxide perovskite predictions 19. Novel wide bandgap oxide perov… 2610.4771438104772
    00:00/00:00
  • Novel wide bandgap oxide perovskite predictions 20. Novel wide bandgap oxide perov… 2712.1121121121123
    00:00/00:00
  • Computational confirmation of results 21. Computational confirmation of … 2730.830830830831
    00:00/00:00
  • Some more suggestions 22. Some more suggestions 2756.5565565565566
    00:00/00:00