Tensile strength prediction of steel sheets: An insight into data-driven models, dimensionality reduction, and feature importance
Publikationen: Beitrag in Fachzeitschrift › Artikel › Forschung › (peer-reviewed)
Standard
in: Modelling and simulation in materials science and engineering, Jahrgang 32.2024, Nr. 7, 075004, 24.08.2024.
Publikationen: Beitrag in Fachzeitschrift › Artikel › Forschung › (peer-reviewed)
Harvard
APA
Vancouver
Author
Bibtex - Download
}
RIS (suitable for import to EndNote) - Download
TY - JOUR
T1 - Tensile strength prediction of steel sheets
T2 - An insight into data-driven models, dimensionality reduction, and feature importance
AU - Millner, Gerfried
AU - Mücke, Manfred
AU - Romaner, Lorenz
AU - Scheiber, Daniel
PY - 2024/8/24
Y1 - 2024/8/24
N2 - In this work we apply data-driven models for predicting tensile strength of steel coils from chemical composition and process parameters. The data originates from steel production and includes a full chemical analysis, as well as many process parameters and the resulting strength properties from tensile tests. We establish a data pre-processing pipeline, where we apply data cleaning and feature engineering to create a machine-readable dataset suitable for various modeling tasks. We compare prediction quality, complexity and interpretability of pure machine learning (ML) models, either with the full feature set or a reduced one. Dimensionality reduction methods are used to reduce the number of features and therefore reduce complexity, either with a smart selection method or feature encoding, where features are combined and the included information is preserved. In order to determine key features of our models, we are investigating feature importance ratings, which can be used as a feature selection criteria. Furthermore, we are highlighting methods to explain predictions and determine the impact of every feature in every observation applicable for any ML model.
AB - In this work we apply data-driven models for predicting tensile strength of steel coils from chemical composition and process parameters. The data originates from steel production and includes a full chemical analysis, as well as many process parameters and the resulting strength properties from tensile tests. We establish a data pre-processing pipeline, where we apply data cleaning and feature engineering to create a machine-readable dataset suitable for various modeling tasks. We compare prediction quality, complexity and interpretability of pure machine learning (ML) models, either with the full feature set or a reduced one. Dimensionality reduction methods are used to reduce the number of features and therefore reduce complexity, either with a smart selection method or feature encoding, where features are combined and the included information is preserved. In order to determine key features of our models, we are investigating feature importance ratings, which can be used as a feature selection criteria. Furthermore, we are highlighting methods to explain predictions and determine the impact of every feature in every observation applicable for any ML model.
U2 - 10.1088/1361-651X/ad6fc0
DO - 10.1088/1361-651X/ad6fc0
M3 - Article
VL - 32.2024
JO - Modelling and simulation in materials science and engineering
JF - Modelling and simulation in materials science and engineering
SN - 0965-0393
IS - 7
M1 - 075004
ER -