Tensile strength prediction of steel sheets: An insight into data-driven models, dimensionality reduction, and feature importance

Publikationen: Beitrag in FachzeitschriftArtikelForschung(peer-reviewed)

Standard

Tensile strength prediction of steel sheets: An insight into data-driven models, dimensionality reduction, and feature importance. / Millner, Gerfried; Mücke, Manfred; Romaner, Lorenz et al.
in: Modelling and simulation in materials science and engineering, Jahrgang 32.2024, Nr. 7, 075004, 24.08.2024.

Publikationen: Beitrag in FachzeitschriftArtikelForschung(peer-reviewed)

Bibtex - Download

@article{4c3b02c94fe34b99ada269ef4d556853,
title = "Tensile strength prediction of steel sheets: An insight into data-driven models, dimensionality reduction, and feature importance",
abstract = "In this work we apply data-driven models for predicting tensile strength of steel coils from chemical composition and process parameters. The data originates from steel production and includes a full chemical analysis, as well as many process parameters and the resulting strength properties from tensile tests. We establish a data pre-processing pipeline, where we apply data cleaning and feature engineering to create a machine-readable dataset suitable for various modeling tasks. We compare prediction quality, complexity and interpretability of pure machine learning (ML) models, either with the full feature set or a reduced one. Dimensionality reduction methods are used to reduce the number of features and therefore reduce complexity, either with a smart selection method or feature encoding, where features are combined and the included information is preserved. In order to determine key features of our models, we are investigating feature importance ratings, which can be used as a feature selection criteria. Furthermore, we are highlighting methods to explain predictions and determine the impact of every feature in every observation applicable for any ML model.",
author = "Gerfried Millner and Manfred M{\"u}cke and Lorenz Romaner and Daniel Scheiber",
year = "2024",
month = aug,
day = "24",
doi = "10.1088/1361-651X/ad6fc0",
language = "English",
volume = "32.2024",
journal = "Modelling and simulation in materials science and engineering",
issn = "0965-0393",
publisher = "IOP Publishing Ltd.",
number = "7",

}

RIS (suitable for import to EndNote) - Download

TY - JOUR

T1 - Tensile strength prediction of steel sheets

T2 - An insight into data-driven models, dimensionality reduction, and feature importance

AU - Millner, Gerfried

AU - Mücke, Manfred

AU - Romaner, Lorenz

AU - Scheiber, Daniel

PY - 2024/8/24

Y1 - 2024/8/24

N2 - In this work we apply data-driven models for predicting tensile strength of steel coils from chemical composition and process parameters. The data originates from steel production and includes a full chemical analysis, as well as many process parameters and the resulting strength properties from tensile tests. We establish a data pre-processing pipeline, where we apply data cleaning and feature engineering to create a machine-readable dataset suitable for various modeling tasks. We compare prediction quality, complexity and interpretability of pure machine learning (ML) models, either with the full feature set or a reduced one. Dimensionality reduction methods are used to reduce the number of features and therefore reduce complexity, either with a smart selection method or feature encoding, where features are combined and the included information is preserved. In order to determine key features of our models, we are investigating feature importance ratings, which can be used as a feature selection criteria. Furthermore, we are highlighting methods to explain predictions and determine the impact of every feature in every observation applicable for any ML model.

AB - In this work we apply data-driven models for predicting tensile strength of steel coils from chemical composition and process parameters. The data originates from steel production and includes a full chemical analysis, as well as many process parameters and the resulting strength properties from tensile tests. We establish a data pre-processing pipeline, where we apply data cleaning and feature engineering to create a machine-readable dataset suitable for various modeling tasks. We compare prediction quality, complexity and interpretability of pure machine learning (ML) models, either with the full feature set or a reduced one. Dimensionality reduction methods are used to reduce the number of features and therefore reduce complexity, either with a smart selection method or feature encoding, where features are combined and the included information is preserved. In order to determine key features of our models, we are investigating feature importance ratings, which can be used as a feature selection criteria. Furthermore, we are highlighting methods to explain predictions and determine the impact of every feature in every observation applicable for any ML model.

U2 - 10.1088/1361-651X/ad6fc0

DO - 10.1088/1361-651X/ad6fc0

M3 - Article

VL - 32.2024

JO - Modelling and simulation in materials science and engineering

JF - Modelling and simulation in materials science and engineering

SN - 0965-0393

IS - 7

M1 - 075004

ER -