Assessment of Outliers in Alloy Datasets Using Unsupervised Techniques
- National Energy Technology Lab. (NETL), Albany, OR (United States). Support Contractor
- Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
- National Energy Technology Lab. (NETL), Albany, OR (United States)
We report advancements in data analytics techniques have enabled complex, disparate datasets to be leveraged for alloy design. Identifying outliers in a dataset can reduce noise, identify erroneous and/or anomalous records, prevent overfitting, and improve model assessment and optimization. In this work, two alloy datasets (9-12% Cr ferritic martensitic steels, and austenitic stainless steels) have been assessed for outliers using unsupervised techniques and supplemented with domain knowledge. Principal component analysis and k-means clustering were applied to the data, and points were assessed as outliers based on their distance away from other points in the cluster and from other points in the dataset. The outlier characteristics were investigated to determine both cluster-specific and overall trends in the properties of the outlier points. The approach demonstrated here is extensible to other alloy datasets for outlier identification and evaluation to improve the reliability of machine learning and modeling predictions for advanced alloy design.
- Research Organization:
- National Energy Technology Laboratory (NETL), Pittsburgh, PA, Morgantown, WV, and Albany, OR (United States); Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE Office of Fossil Energy (FE)
- Grant/Contract Number:
- 89243318CFE000003; AC05-76RL01830
- OSTI ID:
- 1876555
- Report Number(s):
- PNNL-SA-169146
- Journal Information:
- JOM. Journal of the Minerals, Metals & Materials Society, Journal Name: JOM. Journal of the Minerals, Metals & Materials Society Journal Issue: 7 Vol. 74; ISSN 1047-4838
- Publisher:
- SpringerCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Influence of Various Material Design Parameters on Deformation Behaviors of TRIP Steels
Corrosion of austenitic and ferritic-martensitic steels exposed to supercritical carbon dioxide