DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Structure classification and melting temperature prediction in octet AB solids via machine learning

Journal Article · · Physical Review. B, Condensed Matter and Materials Physics

Machine learning methods are being increasingly used in condensed matter physics and materials science to classify crystals structures and predict material properties. However, the reliability of these methods for a given problem, especially when large data sets are unavailable, has not been well studied. By addressing the tasks of classifying crystal structure and predicting melting temperatures of the octet subset of AB solids, we performed such a study and found potential problems with using machine learning methods on relatively small data sets. At the same time, however, we can reaffirm the potential power of such methods for these tasks. In particular, we uncovered an important new material feature, the excess Born effective charge, that significantly increased the accuracy of the predictions for the classification problem we defined. This discovery leads us to propose a new scale for the degree of ionicity and covalency in these solids. More specifically, we partitioned the crystal structures of a set of 75 octet solids into those that are ionic and covalent bonded and thus performed a binary classification task. We found that using the standard indices (rσ,rπ), suggested by St. John and Bloch several decades ago, enabled an average success in classification of 92%. Using just rσ and the excess Born effective charge ΔZA of the A atom enabled an average success of 97%, but we also found relatively large variations about these averages that were dependent on how certain machine learning methods were used and for which a standard deviation was not a proper measure of the degree of confidence we can place in either average. Instead, we calculated and report with 95 % confidence that the traditional classification pair predicts an accuracy in the interval [ 89%, 95%] and the accuracy of the new pair lies in the interval [96 %, 99%]. For melting temperature predictions, the size of our data set was 46. We estimate the root-mean-squared error of our resulting model to be 11% of the mean melting temperature of the data, but we note that if the accuracy of this predicted error is itself measured, our estimated fitting error itself has a root-mean-square error of 50%. In short, what we illustrate is that classification and regression predictions can vary significantly, depending on the details of how machine learning methods are applied to small data sets. This variation makes it important, if not essential, to average the predictions and compute confidence intervals about these averages to report results meaningfully. However, when properly used, these statistical methods can advance our understanding and improve predictions of material properties even for small data sets.

Research Organization:
Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
Sponsoring Organization:
USDOE Laboratory Directed Research and Development (LDRD) Program
Grant/Contract Number:
AC52-06NA25396
OSTI ID:
1469521
Alternate ID(s):
OSTI ID: 1184701
Report Number(s):
LA-UR-14-28547; PRBMDO
Journal Information:
Physical Review. B, Condensed Matter and Materials Physics, Vol. 91, Issue 21; ISSN 1098-0121
Publisher:
American Physical Society (APS)Copyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 51 works
Citation information provided by
Web of Science

References (52)

Pauli-Force Model Potential for Solids journal March 1973
Electron localization: Band-by-band decomposition and application to oxides journal December 2002
Quantum-Defect Electronegativity Scale for Nontransition Elements journal October 1974
Dielectric Classification of Crystal Structures, Ionization Potentials, and Band Structures journal April 1969
Non-nominal value of the dynamical effective charge in alkaline-earth oxides journal June 1997
Debye temperature and melting point of II-VI and III-V semiconductors journal July 2010
Quantum-defect theory of heats of formation and structural transition energies of liquid and solid simple metal alloys and compounds journal March 1978
Projector augmented-wave method journal December 1994
A chemical scale for crystal-structure maps journal July 1984
Phonons and related crystal properties from density-functional perturbation theory journal July 2001
Bond-Orbital Model and the Properties of Tetrahedrally Coordinated Solids journal November 1973
Data mining for materials: Computational experiments with A B compounds journal March 2012
Thousands of samples are needed to generate a robust gene list for predicting outcome in cancer journal April 2006
Fads and fallacies in the name of small-sample microarray classification - A highlight of misunderstanding and erroneous usage in the applications of genomic signal processing journal January 2007
Systematization of the stable crystal structure of all AB -type binary compounds: A pseudopotential orbital-radii approach journal December 1980
Special points for Brillouin-zone integrations journal June 1976
Small Sample Issues for Microarray-Based Classification journal January 2001
Quantum Dielectric Theory of Electronegativity in Covalent Systems. I. Electronic Dielectric Constant journal June 1969
Semiconductor effective charges from tight-binding theory journal June 1996
On the crystal chemistry of normal valence compounds journal December 1959
Role of tight-binding parameters and scaling laws on effective charges in semiconductors journal February 2000
Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set journal October 1996
Performance of Error Estimators for Classification journal March 2010
The Illusion of Distribution-Free Small-Sample Classification in Genomics journal August 2011
Melting point trends in intermetallic alloys journal January 1987
Ionicity of the Chemical Bond in Crystals journal July 1970
Electric polarization as a bulk quantity and its relation to surface charge journal August 1993
Localized Effective Charges in Diatomic Crystals journal August 1971
Scaling Theory of Melting Temperatures of Covalent Crystals journal September 1972
Big Data of Materials Science: Critical Role of the Descriptor journal March 2015
The infrared effective charge in IV-VI compounds. I. A simple one-dimensional model journal November 1979
The melting points of intermetallic compounds journal March 1986
Ground State of the Electron Gas by a Stochastic Method journal August 1980
Dynamical matrices, Born effective charges, dielectric permittivity tensors, and interatomic force constants from density-functional perturbation theory journal April 1997
Electronic Structure book January 2004
Search for regularities in the melting points of AB compounds journal June 1993
Theory of polarization of crystalline solids journal January 1993
Polarization and dynamical charge of ZnO within different one-particle schemes journal December 1995
Structural Stability of 495 Binary Compounds journal March 1980
Dynamical atomic charges: The case of AB O 3 compounds journal September 1998
Machine learning with systematic density-functional theory calculations: Application to melting temperatures of single- and binary-component solids journal February 2014
Macroscopic polarization in crystalline dielectrics: the geometric phase approach journal July 1994
Semiconductor effective charges and dielectric constants in the tight-binding approach journal November 1997
Piezoelectricity journal February 1972
Statistics, Data Mining, and Machine Learning in Astronomy book January 2014
Electronic structure of AlFeN films exhibiting crystallographic orientation change from c- to a-axis with Fe concentrations and annealing effect journal February 2020
Statistics, Data Mining, and Machine Learning in Astronomy journal December 2019
Statistics, Data Mining, and Machine Learning in Astronomy book December 2014
Electronic Structure book September 2020
Machine learning with systematic density-functional theory calculations: Application to melting temperatures of single and binary component solids text January 2013
Big Data of Materials Science - Critical Role of the Descriptor text January 2014
Semiconductor effective charges from tight-binding theory text January 1996

Cited By (12)

A Perspective on Materials Informatics: State-of-the-Art and Challenges book December 2015
Machine learning properties of binary wurtzite superlattices journal January 2018
Predicting defect behavior in B2 intermetallics by merging ab initio modeling and machine learning journal December 2016
Machine learning in materials informatics: recent applications and prospects journal December 2017
Identifying Pb-free perovskites for solar cells by machine learning journal March 2019
Recent advances and applications of machine learning in solid-state materials science journal August 2019
Machine learning bandgaps of double perovskites journal January 2016
Accelerated search for BaTiO 3 -based piezoelectrics with vertical morphotropic phase boundary using Bayesian learning journal November 2016
Uncovering structure-property relationships of materials by subgroup discovery journal January 2017
Predictions of new AB O 3 perovskite compounds by combining machine learning and density functional theory journal April 2018
Finding New Perovskite Halides via Machine Learning journal April 2016
Discovering Phase Transitions with Unsupervised Learning text January 2016