Machine learning concepts in predictive analytics - A case study on wind turbine data

Research output: ThesisMaster's Thesis

Bibtex - Download

@mastersthesis{98a053eecdf94174be512301a77d9a2d,
title = "Machine learning concepts in predictive analytics - A case study on wind turbine data",
abstract = "Due to current structural change in energy systems (energy transition), the construction of wind turbines in order to provide a sustainable production of energy rose to prominence in the last couple of years. While investment in renewable energy has been supported by governments in one way or the other in the past, ceased subsidy may strongly influence profitability in the future. The costs of construction as well as for operations and maintenance (O&M) can be significant. Unscheduled maintenance, often caused by breakdown of the turbine, has been identified as a major part of overall (O&M) costs, owing to increased logistical expenditure and lost revenue. As a consequence condition monitoring systems (CMS) more and more have manifested themselves to measure behavior of the turbine or individual components in order to provide diagnostic information. This in turn facilitates the scheduling of repair work as well as allows the prediction and consecutive avoidance of component failures that may lead to a breakdown. CMS can have access to various data sources in form of time series, that include vibration, voltage levels or performance related information. Common CMS approaches and related research likewise are often focused on anomaly detection in time series by applying physical models and various transformations on the data (e.g. Wavelet transformation). The usage of machine learning algorithms for prediction lacks a more thorough investigation. The subject of this thesis is to contribute to the closure of this gap by applying two different machine learning algorithms to performance related data of a wind turbine in order to detect anomalies and predict events in defined future time intervals. Despite the technical nature of this specific case, we aim to present the applied approach as valuable procedure for any time series prediction as often occurring in the domain of predictive analytics. Besides physical time series data also sequential event data were provided. Because of it's proclaimed applicability for modeling time series, a restricted Boltzmann machine (RBM) with Gaussian visible units has been used. As it is a generative model the trained RBM can estimate the probability of a given set of input variables, that are the sensor values at a specific point in time. In case the probability falls below a threshold the occurrence of an event is predicted. The second applied algorithm was a support vector machine (SVM), a binary linear classifier which is trained through supervised learning. To examine the performances of the methods for the respective time interval a receiver operating characteristic (ROC) was used. Thereby not only false event predictions but also the number of missed events could be investigated. The results substantiate the approach to be valuable for data analysis, although further improvements are possible. Furthermore, the derived predictions can be enhanced by decision prescriptions in the sense of prescriptive analytics. A link to entrepreneurial circumstances could enlarge practical benefits.",
keywords = "maschinelles Lernen, Windkraftanlage, Condition Monitoring System, Support Vector Maschine, Gaussian-Bernoulli-RBM, Predictive Analytics, machine learning, wind turbine, CMS, SVM, Gaussian-Bernoulli-RBM, predictive analytics",
author = "Elmar Steiner",
note = "embargoed until 28-10-2019",
year = "2017",
language = "English",

}

RIS (suitable for import to EndNote) - Download

TY - THES

T1 - Machine learning concepts in predictive analytics - A case study on wind turbine data

AU - Steiner, Elmar

N1 - embargoed until 28-10-2019

PY - 2017

Y1 - 2017

N2 - Due to current structural change in energy systems (energy transition), the construction of wind turbines in order to provide a sustainable production of energy rose to prominence in the last couple of years. While investment in renewable energy has been supported by governments in one way or the other in the past, ceased subsidy may strongly influence profitability in the future. The costs of construction as well as for operations and maintenance (O&M) can be significant. Unscheduled maintenance, often caused by breakdown of the turbine, has been identified as a major part of overall (O&M) costs, owing to increased logistical expenditure and lost revenue. As a consequence condition monitoring systems (CMS) more and more have manifested themselves to measure behavior of the turbine or individual components in order to provide diagnostic information. This in turn facilitates the scheduling of repair work as well as allows the prediction and consecutive avoidance of component failures that may lead to a breakdown. CMS can have access to various data sources in form of time series, that include vibration, voltage levels or performance related information. Common CMS approaches and related research likewise are often focused on anomaly detection in time series by applying physical models and various transformations on the data (e.g. Wavelet transformation). The usage of machine learning algorithms for prediction lacks a more thorough investigation. The subject of this thesis is to contribute to the closure of this gap by applying two different machine learning algorithms to performance related data of a wind turbine in order to detect anomalies and predict events in defined future time intervals. Despite the technical nature of this specific case, we aim to present the applied approach as valuable procedure for any time series prediction as often occurring in the domain of predictive analytics. Besides physical time series data also sequential event data were provided. Because of it's proclaimed applicability for modeling time series, a restricted Boltzmann machine (RBM) with Gaussian visible units has been used. As it is a generative model the trained RBM can estimate the probability of a given set of input variables, that are the sensor values at a specific point in time. In case the probability falls below a threshold the occurrence of an event is predicted. The second applied algorithm was a support vector machine (SVM), a binary linear classifier which is trained through supervised learning. To examine the performances of the methods for the respective time interval a receiver operating characteristic (ROC) was used. Thereby not only false event predictions but also the number of missed events could be investigated. The results substantiate the approach to be valuable for data analysis, although further improvements are possible. Furthermore, the derived predictions can be enhanced by decision prescriptions in the sense of prescriptive analytics. A link to entrepreneurial circumstances could enlarge practical benefits.

AB - Due to current structural change in energy systems (energy transition), the construction of wind turbines in order to provide a sustainable production of energy rose to prominence in the last couple of years. While investment in renewable energy has been supported by governments in one way or the other in the past, ceased subsidy may strongly influence profitability in the future. The costs of construction as well as for operations and maintenance (O&M) can be significant. Unscheduled maintenance, often caused by breakdown of the turbine, has been identified as a major part of overall (O&M) costs, owing to increased logistical expenditure and lost revenue. As a consequence condition monitoring systems (CMS) more and more have manifested themselves to measure behavior of the turbine or individual components in order to provide diagnostic information. This in turn facilitates the scheduling of repair work as well as allows the prediction and consecutive avoidance of component failures that may lead to a breakdown. CMS can have access to various data sources in form of time series, that include vibration, voltage levels or performance related information. Common CMS approaches and related research likewise are often focused on anomaly detection in time series by applying physical models and various transformations on the data (e.g. Wavelet transformation). The usage of machine learning algorithms for prediction lacks a more thorough investigation. The subject of this thesis is to contribute to the closure of this gap by applying two different machine learning algorithms to performance related data of a wind turbine in order to detect anomalies and predict events in defined future time intervals. Despite the technical nature of this specific case, we aim to present the applied approach as valuable procedure for any time series prediction as often occurring in the domain of predictive analytics. Besides physical time series data also sequential event data were provided. Because of it's proclaimed applicability for modeling time series, a restricted Boltzmann machine (RBM) with Gaussian visible units has been used. As it is a generative model the trained RBM can estimate the probability of a given set of input variables, that are the sensor values at a specific point in time. In case the probability falls below a threshold the occurrence of an event is predicted. The second applied algorithm was a support vector machine (SVM), a binary linear classifier which is trained through supervised learning. To examine the performances of the methods for the respective time interval a receiver operating characteristic (ROC) was used. Thereby not only false event predictions but also the number of missed events could be investigated. The results substantiate the approach to be valuable for data analysis, although further improvements are possible. Furthermore, the derived predictions can be enhanced by decision prescriptions in the sense of prescriptive analytics. A link to entrepreneurial circumstances could enlarge practical benefits.

KW - maschinelles Lernen

KW - Windkraftanlage

KW - Condition Monitoring System

KW - Support Vector Maschine

KW - Gaussian-Bernoulli-RBM

KW - Predictive Analytics

KW - machine learning

KW - wind turbine

KW - CMS

KW - SVM

KW - Gaussian-Bernoulli-RBM

KW - predictive analytics

M3 - Master's Thesis

ER -