Structure classification and melting temperature prediction in octet AB solids via machine learning
Abstract
Machine learning methods are being increasingly used in condensed matter physics and materials science to classify crystals structures and predict material properties. However, the reliability of these methods for a given problem, especially when large data sets are unavailable, has not been well studied. By addressing the tasks of classifying crystal structure and predicting melting temperatures of the octet subset of AB solids, we performed such a study and found potential problems with using machine learning methods on relatively small data sets. At the same time, however, we can reaffirm the potential power of such methods for these tasks. In particular, we uncovered an important new material feature, the excess Born effective charge, that significantly increased the accuracy of the predictions for the classification problem we defined. This discovery leads us to propose a new scale for the degree of ionicity and covalency in these solids. More specifically, we partitioned the crystal structures of a set of 75 octet solids into those that are ionic and covalent bonded and thus performed a binary classification task. We found that using the standard indices (rσ,rπ), suggested by St. John and Bloch several decades ago, enabled an average success in classification of 92%.more »
- Authors:
-
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
- Publication Date:
- Research Org.:
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
- Sponsoring Org.:
- USDOE Laboratory Directed Research and Development (LDRD) Program
- OSTI Identifier:
- 1469521
- Alternate Identifier(s):
- OSTI ID: 1184701
- Report Number(s):
- LA-UR-14-28547
Journal ID: ISSN 1098-0121; PRBMDO
- Grant/Contract Number:
- AC52-06NA25396
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Physical Review. B, Condensed Matter and Materials Physics
- Additional Journal Information:
- Journal Volume: 91; Journal Issue: 21; Journal ID: ISSN 1098-0121
- Publisher:
- American Physical Society (APS)
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 36 MATERIALS SCIENCE; 97 MATHEMATICS AND COMPUTING; Information Science; Material Science
Citation Formats
Pilania, Ghanshyam, Gubernatis, James E., and Lookman, Turab. Structure classification and melting temperature prediction in octet AB solids via machine learning. United States: N. p., 2015.
Web. doi:10.1103/PhysRevB.91.214302.
Pilania, Ghanshyam, Gubernatis, James E., & Lookman, Turab. Structure classification and melting temperature prediction in octet AB solids via machine learning. United States. https://doi.org/10.1103/PhysRevB.91.214302
Pilania, Ghanshyam, Gubernatis, James E., and Lookman, Turab. Mon .
"Structure classification and melting temperature prediction in octet AB solids via machine learning". United States. https://doi.org/10.1103/PhysRevB.91.214302. https://www.osti.gov/servlets/purl/1469521.
@article{osti_1469521,
title = {Structure classification and melting temperature prediction in octet AB solids via machine learning},
author = {Pilania, Ghanshyam and Gubernatis, James E. and Lookman, Turab},
abstractNote = {Machine learning methods are being increasingly used in condensed matter physics and materials science to classify crystals structures and predict material properties. However, the reliability of these methods for a given problem, especially when large data sets are unavailable, has not been well studied. By addressing the tasks of classifying crystal structure and predicting melting temperatures of the octet subset of AB solids, we performed such a study and found potential problems with using machine learning methods on relatively small data sets. At the same time, however, we can reaffirm the potential power of such methods for these tasks. In particular, we uncovered an important new material feature, the excess Born effective charge, that significantly increased the accuracy of the predictions for the classification problem we defined. This discovery leads us to propose a new scale for the degree of ionicity and covalency in these solids. More specifically, we partitioned the crystal structures of a set of 75 octet solids into those that are ionic and covalent bonded and thus performed a binary classification task. We found that using the standard indices (rσ,rπ), suggested by St. John and Bloch several decades ago, enabled an average success in classification of 92%. Using just rσ and the excess Born effective charge ΔZA of the A atom enabled an average success of 97%, but we also found relatively large variations about these averages that were dependent on how certain machine learning methods were used and for which a standard deviation was not a proper measure of the degree of confidence we can place in either average. Instead, we calculated and report with 95 % confidence that the traditional classification pair predicts an accuracy in the interval [ 89%, 95%] and the accuracy of the new pair lies in the interval [96 %, 99%]. For melting temperature predictions, the size of our data set was 46. We estimate the root-mean-squared error of our resulting model to be 11% of the mean melting temperature of the data, but we note that if the accuracy of this predicted error is itself measured, our estimated fitting error itself has a root-mean-square error of 50%. In short, what we illustrate is that classification and regression predictions can vary significantly, depending on the details of how machine learning methods are applied to small data sets. This variation makes it important, if not essential, to average the predictions and compute confidence intervals about these averages to report results meaningfully. However, when properly used, these statistical methods can advance our understanding and improve predictions of material properties even for small data sets.},
doi = {10.1103/PhysRevB.91.214302},
journal = {Physical Review. B, Condensed Matter and Materials Physics},
number = 21,
volume = 91,
place = {United States},
year = {2015},
month = {6}
}
Web of Science
Works referenced in this record:
Pauli-Force Model Potential for Solids
journal, March 1973
- Simons, Gary; Bloch, Aaron N.
- Physical Review B, Vol. 7, Issue 6
Electron localization: Band-by-band decomposition and application to oxides
journal, December 2002
- Veithen, M.; Gonze, X.; Ghosez, Ph.
- Physical Review B, Vol. 66, Issue 23
Quantum-Defect Electronegativity Scale for Nontransition Elements
journal, October 1974
- John, Judith; Bloch, Aaron N.
- Physical Review Letters, Vol. 33, Issue 18
Dielectric Classification of Crystal Structures, Ionization Potentials, and Band Structures
journal, April 1969
- Phillips, J. C.; Van Vechten, J. A.
- Physical Review Letters, Vol. 22, Issue 14
Non-nominal value of the dynamical effective charge in alkaline-earth oxides
journal, June 1997
- Posternak, M.; Baldereschi, A.; Krakauer, H.
- Physical Review B, Vol. 55, Issue 24
Debye temperature and melting point of II-VI and III-V semiconductors
journal, July 2010
- Kumar, V.; Jha, V.; Shrivastava, A. K.
- Crystal Research and Technology, Vol. 45, Issue 9
Quantum-defect theory of heats of formation and structural transition energies of liquid and solid simple metal alloys and compounds
journal, March 1978
- Chelikowsky, J. R.; Phillips, J. C.
- Physical Review B, Vol. 17, Issue 6
Projector augmented-wave method
journal, December 1994
- Blöchl, P. E.
- Physical Review B, Vol. 50, Issue 24, p. 17953-17979
A chemical scale for crystal-structure maps
journal, July 1984
- Pettifor, D. G.
- Solid State Communications, Vol. 51, Issue 1
Phonons and related crystal properties from density-functional perturbation theory
journal, July 2001
- Baroni, Stefano; de Gironcoli, Stefano; Dal Corso, Andrea
- Reviews of Modern Physics, Vol. 73, Issue 2
Bond-Orbital Model and the Properties of Tetrahedrally Coordinated Solids
journal, November 1973
- Harrison, Walter A.
- Physical Review B, Vol. 8, Issue 10
Data mining for materials: Computational experiments with compounds
journal, March 2012
- Saad, Yousef; Gao, Da; Ngo, Thanh
- Physical Review B, Vol. 85, Issue 10
Thousands of samples are needed to generate a robust gene list for predicting outcome in cancer
journal, April 2006
- Ein-Dor, L.; Zuk, O.; Domany, E.
- Proceedings of the National Academy of Sciences, Vol. 103, Issue 15
Fads and fallacies in the name of small-sample microarray classification - A highlight of misunderstanding and erroneous usage in the applications of genomic signal processing
journal, January 2007
- Braga-Neto, U.
- IEEE Signal Processing Magazine, Vol. 24, Issue 1
Systematization of the stable crystal structure of all -type binary compounds: A pseudopotential orbital-radii approach
journal, December 1980
- Zunger, Alex
- Physical Review B, Vol. 22, Issue 12
Special points for Brillouin-zone integrations
journal, June 1976
- Monkhorst, Hendrik J.; Pack, James D.
- Physical Review B, Vol. 13, Issue 12, p. 5188-5192
Small Sample Issues for Microarray-Based Classification
journal, January 2001
- Dougherty, Edward R.
- Comparative and Functional Genomics, Vol. 2, Issue 1
Quantum Dielectric Theory of Electronegativity in Covalent Systems. I. Electronic Dielectric Constant
journal, June 1969
- Van Vechten, J. A.
- Physical Review, Vol. 182, Issue 3
Semiconductor effective charges from tight-binding theory
journal, June 1996
- Bennetto, J.; Vanderbilt, David
- Physical Review B, Vol. 53, Issue 23
On the crystal chemistry of normal valence compounds
journal, December 1959
- Mooser, E.; Pearson, W. B.
- Acta Crystallographica, Vol. 12, Issue 12
Role of tight-binding parameters and scaling laws on effective charges in semiconductors
journal, February 2000
- Iessi, U.; Parisi, C.; Bernasconi, M.
- Physical Review B, Vol. 61, Issue 7
Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set
journal, October 1996
- Kresse, G.; Furthmüller, J.
- Physical Review B, Vol. 54, Issue 16, p. 11169-11186
Performance of Error Estimators for Classification
journal, March 2010
- Dougherty, Edward; Sima, Chao; Hua, >
- Current Bioinformatics, Vol. 5, Issue 1
The Illusion of Distribution-Free Small-Sample Classification in Genomics
journal, August 2011
- R. Dougherty, Edward; Zollanvari, Amin; M. Braga-Neto, Ulisses
- Current Genomics, Vol. 12, Issue 5
Melting point trends in intermetallic alloys
journal, January 1987
- Chelikowsky, James R.; Anderson, Karen E.
- Journal of Physics and Chemistry of Solids, Vol. 48, Issue 2
Localized Effective Charges in Diatomic Crystals
journal, August 1971
- Lucovsky, G.; Martin, Richard M.; Burstein, E.
- Physical Review B, Vol. 4, Issue 4
Scaling Theory of Melting Temperatures of Covalent Crystals
journal, September 1972
- Van Vechten, J. A.
- Physical Review Letters, Vol. 29, Issue 12
Big Data of Materials Science: Critical Role of the Descriptor
journal, March 2015
- Ghiringhelli, Luca M.; Vybiral, Jan; Levchenko, Sergey V.
- Physical Review Letters, Vol. 114, Issue 10
The infrared effective charge in IV-VI compounds. I. A simple one-dimensional model
journal, November 1979
- Littlewood, P. B.; Heine, V.
- Journal of Physics C: Solid State Physics, Vol. 12, Issue 21
The melting points of intermetallic compounds
journal, March 1986
- Chelikowsky, J. R.; Anderson, K. E.
- Physics Letters A, Vol. 114, Issue 8-9
Ground State of the Electron Gas by a Stochastic Method
journal, August 1980
- Ceperley, D. M.; Alder, B. J.
- Physical Review Letters, Vol. 45, Issue 7, p. 566-569
Dynamical matrices, Born effective charges, dielectric permittivity tensors, and interatomic force constants from density-functional perturbation theory
journal, April 1997
- Gonze, Xavier; Lee, Changyol
- Physical Review B, Vol. 55, Issue 16
Search for regularities in the melting points of AB compounds
journal, June 1993
- Hulliger, F.; Villars, P.
- Journal of Alloys and Compounds, Vol. 197, Issue 2
Theory of polarization of crystalline solids
journal, January 1993
- King-Smith, R. D.; Vanderbilt, David
- Physical Review B, Vol. 47, Issue 3
Polarization and dynamical charge of ZnO within different one-particle schemes
journal, December 1995
- Massidda, S.; Resta, R.; Posternak, M.
- Physical Review B, Vol. 52, Issue 24
Structural Stability of 495 Binary Compounds
journal, March 1980
- Zunger, Alex
- Physical Review Letters, Vol. 44, Issue 9
Dynamical atomic charges: The case of compounds
journal, September 1998
- Ghosez, Ph.; Michenaud, J. -P.; Gonze, X.
- Physical Review B, Vol. 58, Issue 10
Machine learning with systematic density-functional theory calculations: Application to melting temperatures of single- and binary-component solids
journal, February 2014
- Seko, Atsuto; Maekawa, Tomoya; Tsuda, Koji
- Physical Review B, Vol. 89, Issue 5
Semiconductor effective charges and dielectric constants in the tight-binding approach
journal, November 1997
- Di Ventra, Massimiliano; Fernández, Pablo
- Physical Review B, Vol. 56, Issue 20
Electronic structure of AlFeN films exhibiting crystallographic orientation change from c- to a-axis with Fe concentrations and annealing effect
journal, February 2020
- Tatemizo, Nobuyuki; Imada, Saki; Okahara, Kizuna
- Scientific Reports, Vol. 10, Issue 1
Machine learning with systematic density-functional theory calculations: Application to melting temperatures of single and binary component solids
text, January 2013
- Seko, Atsuto; Maekawa, Tomoya; Tsuda, Koji
- arXiv
Big Data of Materials Science - Critical Role of the Descriptor
text, January 2014
- Ghiringhelli, Luca M.; Vybiral, Jan; Levchenko, Sergey V.
- arXiv
Semiconductor effective charges from tight-binding theory
text, January 1996
- Bennetto, J.; Vanderbilt, David
- arXiv
Works referencing / citing this record:
Machine learning properties of binary wurtzite superlattices
journal, January 2018
- Pilania, G.; Liu, X. -Y.
- Journal of Materials Science, Vol. 53, Issue 9
Predicting defect behavior in B2 intermetallics by merging ab initio modeling and machine learning
journal, December 2016
- Medasani, Bharat; Gamst, Anthony; Ding, Hong
- npj Computational Materials, Vol. 2, Issue 1
Machine learning in materials informatics: recent applications and prospects
journal, December 2017
- Ramprasad, Rampi; Batra, Rohit; Pilania, Ghanshyam
- npj Computational Materials, Vol. 3, Issue 1
Identifying Pb-free perovskites for solar cells by machine learning
journal, March 2019
- Im, Jino; Lee, Seongwon; Ko, Tae-Wook
- npj Computational Materials, Vol. 5, Issue 1
Recent advances and applications of machine learning in solid-state materials science
journal, August 2019
- Schmidt, Jonathan; Marques, Mário R. G.; Botti, Silvana
- npj Computational Materials, Vol. 5, Issue 1
Machine learning bandgaps of double perovskites
journal, January 2016
- Pilania, G.; Mannodi-Kanakkithodi, A.; Uberuaga, B. P.
- Scientific Reports, Vol. 6, Issue 1
Accelerated search for BaTiO 3 -based piezoelectrics with vertical morphotropic phase boundary using Bayesian learning
journal, November 2016
- Xue, Dezhen; Balachandran, Prasanna V.; Yuan, Ruihao
- Proceedings of the National Academy of Sciences, Vol. 113, Issue 47
Uncovering structure-property relationships of materials by subgroup discovery
journal, January 2017
- Goldsmith, Bryan R.; Boley, Mario; Vreeken, Jilles
- New Journal of Physics, Vol. 19, Issue 1
Discovering phase transitions with unsupervised learning
journal, November 2016
- Wang, Lei
- Physical Review B, Vol. 94, Issue 19
Finding New Perovskite Halides via Machine Learning
journal, April 2016
- Pilania, Ghanshyam; Balachandran, Prasanna V.; Kim, Chiho
- Frontiers in Materials, Vol. 3