Time series anomaly detection
Anomaly detection on time series data can be applied to many domains. It can be applied to machinery prognostics and health management (PHM) which is crucial to ensure a system’s reliability, increase operational safety and reduce maintenance cost. In this paper, anomaly detection is done on a...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/175134 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-175134 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1751342024-04-26T15:43:17Z Time series anomaly detection Lek, Jie Kai Kwoh Chee Keong School of Computer Science and Engineering ASCKKWOH@ntu.edu.sg Computer and Information Science Anomaly detection on time series data can be applied to many domains. It can be applied to machinery prognostics and health management (PHM) which is crucial to ensure a system’s reliability, increase operational safety and reduce maintenance cost. In this paper, anomaly detection is done on a time series aero-engine dataset by implementing an autoencoder. The time series dataset consists of multiple sensor readings of each engine and the Remaining Useful Life (RUL) of each engine at each point in time. The model is trained on a subset of the dataset where the RUL label is 130 (i.e. the maximum RUL of each engine), which represents the normal operating conditions of the engines, before the onset of degradation. The model is trained to minimize the difference between its original input and its reconstruction, quantified using the mean squared error. After training, the model is applied to the test dataset and the reconstruction error is calculated for each engine in the dataset. Anomalies are identified as points where reconstruction errors fall above the Interquartile Range and the first index of these anomalies will be interpreted as the point in time when anomalous behavior becomes prevalent, signaling the onset of degradation. In addition, after identifying the onset of degradation, a step is further taken to predict the RUL of each engine by implementing a deep CNN model. However, the degradation pattern varies across different entities and engines across its RUL and the engine’s RUL at a certain time step cannot be treated as a deterministic time value in most cases by simply setting the exact RUL labels as the labels for the data samples. Therefore, this project further explores the implementation of cycle-consistent learning, where it learns a new data representation subspace such that the degradation data of machines in similar health conditions can be well aligned across different entities to take into account the variation in degradation characteristics across different entities or engines. The model is trained and evaluated on the test dataset to predict the RUL of each engine. Performance of the autoencoder model is evaluated based on its predicted onset of degradation as compared to the actual onset of degradation. In addition, the performance of the CNN model with Cycle-consistent Learning is evaluated based on the RMSE. Lastly, the proposed model is compared against a deep CNN model as the baseline model where it is used to predict the RUL of the engines and the respective RMSE is obtained. The RMSE of the proposed model is lower than the RMSE of the baseline model. Bachelor's degree 2024-04-22T03:51:12Z 2024-04-22T03:51:12Z 2024 Final Year Project (FYP) Lek, J. K. (2024). Time series anomaly detection. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175134 https://hdl.handle.net/10356/175134 en SCSE23-0699 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Computer and Information Science |
spellingShingle |
Computer and Information Science Lek, Jie Kai Time series anomaly detection |
description |
Anomaly detection on time series data can be applied to many domains. It can be applied to
machinery prognostics and health management (PHM) which is crucial to ensure a system’s
reliability, increase operational safety and reduce maintenance cost.
In this paper, anomaly detection is done on a time series aero-engine dataset by implementing an
autoencoder. The time series dataset consists of multiple sensor readings of each engine and the
Remaining Useful Life (RUL) of each engine at each point in time. The model is trained on a
subset of the dataset where the RUL label is 130 (i.e. the maximum RUL of each engine), which
represents the normal operating conditions of the engines, before the onset of degradation. The
model is trained to minimize the difference between its original input and its reconstruction,
quantified using the mean squared error. After training, the model is applied to the test dataset
and the reconstruction error is calculated for each engine in the dataset. Anomalies are identified
as points where reconstruction errors fall above the Interquartile Range and the first index of
these anomalies will be interpreted as the point in time when anomalous behavior becomes
prevalent, signaling the onset of degradation.
In addition, after identifying the onset of degradation, a step is further taken to predict the RUL
of each engine by implementing a deep CNN model.
However, the degradation pattern varies across different entities and engines across its RUL and
the engine’s RUL at a certain time step cannot be treated as a deterministic time value in most
cases by simply setting the exact RUL labels as the labels for the data samples. Therefore, this
project further explores the implementation of cycle-consistent learning, where it learns a new
data representation subspace such that the degradation data of machines in similar health
conditions can be well aligned across different entities to take into account the variation in
degradation characteristics across different entities or engines.
The model is trained and evaluated on the test dataset to predict the RUL of each engine.
Performance of the autoencoder model is evaluated based on its predicted onset of degradation as
compared to the actual onset of degradation. In addition, the performance of the CNN model
with Cycle-consistent Learning is evaluated based on the RMSE.
Lastly, the proposed model is compared against a deep CNN model as the baseline model where
it is used to predict the RUL of the engines and the respective RMSE is obtained. The RMSE of
the proposed model is lower than the RMSE of the baseline model. |
author2 |
Kwoh Chee Keong |
author_facet |
Kwoh Chee Keong Lek, Jie Kai |
format |
Final Year Project |
author |
Lek, Jie Kai |
author_sort |
Lek, Jie Kai |
title |
Time series anomaly detection |
title_short |
Time series anomaly detection |
title_full |
Time series anomaly detection |
title_fullStr |
Time series anomaly detection |
title_full_unstemmed |
Time series anomaly detection |
title_sort |
time series anomaly detection |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/175134 |
_version_ |
1814047245164609536 |