This document introduces basic concepts of data science and machine learning in the context of materials science applications. The focus is on hands-on activities where readers use open, online tools in nanoHUB to explore the concepts being introduced. Topics covered include querying data resources and data organization and preparation. Regression exercises, including neural networks, to predict materials properties from a set of descriptors and a classification exercise designed to predict the crystal structure of elemental metals. The examples are provided as fully contained Jupyter notebooks and state of the art, open software. They are designed to introduce to the main steps in data science workflows and readers can modify or extend them to solve additional problems.
Cite this work
Researchers should cite this work as follows: