Issue 5, 2018

Can machine learning identify the next high-temperature superconductor? Examining extrapolation performance for materials discovery

Abstract

Traditional machine learning (ML) metrics overestimate model performance for materials discovery. We introduce (1) leave-one-cluster-out cross-validation (LOCO CV) and (2) a simple nearest-neighbor benchmark to show that model performance in discovery applications strongly depends on the problem, data sampling, and extrapolation. Our results suggest that ML-guided iterative experimentation may outperform standard high-throughput screening for discovering breakthrough materials like high-Tc superconductors with ML.

Graphical abstract: Can machine learning identify the next high-temperature superconductor? Examining extrapolation performance for materials discovery

Article information

Article type
Communication
Submitted
05 Mar 2018
Accepted
11 Jul 2018
First published
17 Aug 2018
This article is Open Access
Creative Commons BY license

Mol. Syst. Des. Eng., 2018,3, 819-825

Can machine learning identify the next high-temperature superconductor? Examining extrapolation performance for materials discovery

B. Meredig, E. Antono, C. Church, M. Hutchinson, J. Ling, S. Paradiso, B. Blaiszik, I. Foster, B. Gibbons, J. Hattrick-Simpers, A. Mehta and L. Ward, Mol. Syst. Des. Eng., 2018, 3, 819 DOI: 10.1039/C8ME00012C

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements