Detecting Anomalous Multivariate Time-Series via Hybrid Machine Learning

Research output: Contribution to journalArticleResearchpeer-review

Standard

Detecting Anomalous Multivariate Time-Series via Hybrid Machine Learning. / Terbuch, Anika; O'Leary, Paul; Khalilimotlaghkasmaei, Negin et al.
In: IEEE transactions on instrumentation and measurement, Vol. 72.2023, 2503711, 12.01.2023.

Research output: Contribution to journalArticleResearchpeer-review

Bibtex - Download

@article{5a2b1a10fb7d40e9ae075cd226e15181,
title = "Detecting Anomalous Multivariate Time-Series via Hybrid Machine Learning",
abstract = "This article investigates the use of hybrid machine learning (HML) for the detection of anomalous multivariate time-series (MVTS). Focusing on a specific industrial use-case from geotechnical engineering, where hundreds of MVTS need to be analyzed and classified, has permitted extensive testing of the proposed methods with real measurement data. The novel hybrid anomaly detector combines two means for detection, creating redundancy and reducing the risk of missing defective elements in a safety relevant application. The two parts are: 1) anomaly detection based on approximately 50 physics-motivated key performance indicators (KPIs) and 2) an unsupervised variational autoencoder (VAE) with long short-term memory layers. The KPI captures expert knowledge on the properties of the data that infer the quality of produced elements; these are used as a type of auto-labeling. The goal of the extension using machine learning (ML) is to detect anomalies that the experts may not have foreseen. In contrast to anomaly detection in streaming data, where the goal is to locate an anomaly, each MVTS is complete in itself at the time of evaluation and is categorized as anomalous or nonanomalous. The article compares the performance of different VAE architectures [e.g., long short-term memory (LSTM-VAE) and bidirectional LSTM (BiLSTM-VAE)]. The results of using a genetic algorithm to optimize the hyperparameters of the different architectures are also presented. It is shown that modeling the industrial process as an assemblage of subprocesses yields a better discriminating power and permits the identification of interdependencies between the subprocesses. Interestingly, different autoencoder architectures may be optimal for different subprocesses; here two different architectures are combined to achieve superior performance. Extensive results are presented based on a very large set of real-time measurement data.",
author = "Anika Terbuch and Paul O'Leary and Negin Khalilimotlaghkasmaei and Peter Auer and Alexander Z{\"o}hrer and Vincent Winter",
note = "Publisher Copyright: {\textcopyright} 1963-2012 IEEE.",
year = "2023",
month = jan,
day = "12",
doi = "10.1109/TIM.2023.3236354",
language = "English",
volume = "72.2023",
journal = "IEEE transactions on instrumentation and measurement",
issn = "0018-9456",
publisher = "Institute of Electrical and Electronics Engineers",

}

RIS (suitable for import to EndNote) - Download

TY - JOUR

T1 - Detecting Anomalous Multivariate Time-Series via Hybrid Machine Learning

AU - Terbuch, Anika

AU - O'Leary, Paul

AU - Khalilimotlaghkasmaei, Negin

AU - Auer, Peter

AU - Zöhrer, Alexander

AU - Winter, Vincent

N1 - Publisher Copyright: © 1963-2012 IEEE.

PY - 2023/1/12

Y1 - 2023/1/12

N2 - This article investigates the use of hybrid machine learning (HML) for the detection of anomalous multivariate time-series (MVTS). Focusing on a specific industrial use-case from geotechnical engineering, where hundreds of MVTS need to be analyzed and classified, has permitted extensive testing of the proposed methods with real measurement data. The novel hybrid anomaly detector combines two means for detection, creating redundancy and reducing the risk of missing defective elements in a safety relevant application. The two parts are: 1) anomaly detection based on approximately 50 physics-motivated key performance indicators (KPIs) and 2) an unsupervised variational autoencoder (VAE) with long short-term memory layers. The KPI captures expert knowledge on the properties of the data that infer the quality of produced elements; these are used as a type of auto-labeling. The goal of the extension using machine learning (ML) is to detect anomalies that the experts may not have foreseen. In contrast to anomaly detection in streaming data, where the goal is to locate an anomaly, each MVTS is complete in itself at the time of evaluation and is categorized as anomalous or nonanomalous. The article compares the performance of different VAE architectures [e.g., long short-term memory (LSTM-VAE) and bidirectional LSTM (BiLSTM-VAE)]. The results of using a genetic algorithm to optimize the hyperparameters of the different architectures are also presented. It is shown that modeling the industrial process as an assemblage of subprocesses yields a better discriminating power and permits the identification of interdependencies between the subprocesses. Interestingly, different autoencoder architectures may be optimal for different subprocesses; here two different architectures are combined to achieve superior performance. Extensive results are presented based on a very large set of real-time measurement data.

AB - This article investigates the use of hybrid machine learning (HML) for the detection of anomalous multivariate time-series (MVTS). Focusing on a specific industrial use-case from geotechnical engineering, where hundreds of MVTS need to be analyzed and classified, has permitted extensive testing of the proposed methods with real measurement data. The novel hybrid anomaly detector combines two means for detection, creating redundancy and reducing the risk of missing defective elements in a safety relevant application. The two parts are: 1) anomaly detection based on approximately 50 physics-motivated key performance indicators (KPIs) and 2) an unsupervised variational autoencoder (VAE) with long short-term memory layers. The KPI captures expert knowledge on the properties of the data that infer the quality of produced elements; these are used as a type of auto-labeling. The goal of the extension using machine learning (ML) is to detect anomalies that the experts may not have foreseen. In contrast to anomaly detection in streaming data, where the goal is to locate an anomaly, each MVTS is complete in itself at the time of evaluation and is categorized as anomalous or nonanomalous. The article compares the performance of different VAE architectures [e.g., long short-term memory (LSTM-VAE) and bidirectional LSTM (BiLSTM-VAE)]. The results of using a genetic algorithm to optimize the hyperparameters of the different architectures are also presented. It is shown that modeling the industrial process as an assemblage of subprocesses yields a better discriminating power and permits the identification of interdependencies between the subprocesses. Interestingly, different autoencoder architectures may be optimal for different subprocesses; here two different architectures are combined to achieve superior performance. Extensive results are presented based on a very large set of real-time measurement data.

UR - http://www.scopus.com/inward/record.url?scp=85147286411&partnerID=8YFLogxK

U2 - 10.1109/TIM.2023.3236354

DO - 10.1109/TIM.2023.3236354

M3 - Article

VL - 72.2023

JO - IEEE transactions on instrumentation and measurement

JF - IEEE transactions on instrumentation and measurement

SN - 0018-9456

M1 - 2503711

ER -