This tutorial will introduce core concepts of machine learning through the lens of a basic workflow to predict material bandgaps from material compositions. As we progress through this workflow, we’ll highlight key steps, challenges that can come up with materials data, and potential solutions to these challenges. The core workflow we’ll introduce includes: Data Cleaning, Feature Generation, Feature Engineering, Establishing Model Assessment, Training a Default Model, Hyperparameter Optimization, and Making Predictions. By the end of the workshop I hope that you’ll have a better understanding of these core concepts, and how they can all fit together.
To preview the materials ahead of time you can find them on the nanoHUB tool Machine Learning Lab Module.
Cite this work
Researchers should cite this work as follows: