A Systematic Review of Anomaly Detection within High Dimensional and Multivariate Data

In data analysis, recognizing unusual patterns (outliers’ analysis or anomaly detection) plays a crucial role in identifying critical events. Because of its widespread use in many applications, it remains an important and extensive research brand in data mining. As a result, numerous techniques for...

Full description

Saved in:
Bibliographic Details
Main Authors: Suboh, Syahirah, Abdul Aziz, Izzatdin, Shaharudin, Shazlyn Milleana, Akmar Ismail, Saidatul, Mahdin, Hairulnizam
Format: Article
Language:English
Published: JOIV 2023
Subjects:
Online Access:http://eprints.uthm.edu.my/9805/1/J15862_f3944b7e279a07421e2ed97fc6d397d2.pdf
http://eprints.uthm.edu.my/9805/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Tun Hussein Onn Malaysia
Language: English
id my.uthm.eprints.9805
record_format eprints
spelling my.uthm.eprints.98052023-09-13T07:22:37Z http://eprints.uthm.edu.my/9805/ A Systematic Review of Anomaly Detection within High Dimensional and Multivariate Data Suboh, Syahirah Abdul Aziz, Izzatdin Shaharudin, Shazlyn Milleana Akmar Ismail, Saidatul Mahdin, Hairulnizam T Technology (General) In data analysis, recognizing unusual patterns (outliers’ analysis or anomaly detection) plays a crucial role in identifying critical events. Because of its widespread use in many applications, it remains an important and extensive research brand in data mining. As a result, numerous techniques for finding anomalies have been developed, and more are still being worked on. Researchers can gain vital knowledge by identifying anomalies, which helps them make better meaningful data analyses. However, anomaly detection is even more challenging when the datasets are high-dimensional and multivariate. In the literature, anomaly detection has received much attention but not as much as anomaly detection, specifically in high dimensional and multivariate conditions. This paper systematically reviews the existing related techniques and presents extensive coverage of challenges and perspectives of anomaly detection within highdimensional and multivariate data. At the same time, it provides a clear insight into the techniques developed for anomaly detection problems. This paper aims to help select the best technique that suits its rightful purpose. It has been found that PCA, DOBIN, Stray algorithm, and DAE-KNN have a high learning rate compared to Random projection, ROBEM, and OCP methods. Overall, most methods have shown an excellent ability to tackle the curse of dimensionality and multivariate features to perform anomaly detection. Moreover, a comparison of each algorithm for anomaly detection is also provided to produce a better algorithm. Finally, it would be a line of future studies to extend by comparing the methods on other domain-specific datasets and offering a comprehensive anomaly interpretation in describing the truth of anomalies. JOIV 2023 Article PeerReviewed text en http://eprints.uthm.edu.my/9805/1/J15862_f3944b7e279a07421e2ed97fc6d397d2.pdf Suboh, Syahirah and Abdul Aziz, Izzatdin and Shaharudin, Shazlyn Milleana and Akmar Ismail, Saidatul and Mahdin, Hairulnizam (2023) A Systematic Review of Anomaly Detection within High Dimensional and Multivariate Data. INTERNATIONAL JOURNAL ON INFORMATICS VISUALIZATION, 7 (1). pp. 122-130.
institution Universiti Tun Hussein Onn Malaysia
building UTHM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Tun Hussein Onn Malaysia
content_source UTHM Institutional Repository
url_provider http://eprints.uthm.edu.my/
language English
topic T Technology (General)
spellingShingle T Technology (General)
Suboh, Syahirah
Abdul Aziz, Izzatdin
Shaharudin, Shazlyn Milleana
Akmar Ismail, Saidatul
Mahdin, Hairulnizam
A Systematic Review of Anomaly Detection within High Dimensional and Multivariate Data
description In data analysis, recognizing unusual patterns (outliers’ analysis or anomaly detection) plays a crucial role in identifying critical events. Because of its widespread use in many applications, it remains an important and extensive research brand in data mining. As a result, numerous techniques for finding anomalies have been developed, and more are still being worked on. Researchers can gain vital knowledge by identifying anomalies, which helps them make better meaningful data analyses. However, anomaly detection is even more challenging when the datasets are high-dimensional and multivariate. In the literature, anomaly detection has received much attention but not as much as anomaly detection, specifically in high dimensional and multivariate conditions. This paper systematically reviews the existing related techniques and presents extensive coverage of challenges and perspectives of anomaly detection within highdimensional and multivariate data. At the same time, it provides a clear insight into the techniques developed for anomaly detection problems. This paper aims to help select the best technique that suits its rightful purpose. It has been found that PCA, DOBIN, Stray algorithm, and DAE-KNN have a high learning rate compared to Random projection, ROBEM, and OCP methods. Overall, most methods have shown an excellent ability to tackle the curse of dimensionality and multivariate features to perform anomaly detection. Moreover, a comparison of each algorithm for anomaly detection is also provided to produce a better algorithm. Finally, it would be a line of future studies to extend by comparing the methods on other domain-specific datasets and offering a comprehensive anomaly interpretation in describing the truth of anomalies.
format Article
author Suboh, Syahirah
Abdul Aziz, Izzatdin
Shaharudin, Shazlyn Milleana
Akmar Ismail, Saidatul
Mahdin, Hairulnizam
author_facet Suboh, Syahirah
Abdul Aziz, Izzatdin
Shaharudin, Shazlyn Milleana
Akmar Ismail, Saidatul
Mahdin, Hairulnizam
author_sort Suboh, Syahirah
title A Systematic Review of Anomaly Detection within High Dimensional and Multivariate Data
title_short A Systematic Review of Anomaly Detection within High Dimensional and Multivariate Data
title_full A Systematic Review of Anomaly Detection within High Dimensional and Multivariate Data
title_fullStr A Systematic Review of Anomaly Detection within High Dimensional and Multivariate Data
title_full_unstemmed A Systematic Review of Anomaly Detection within High Dimensional and Multivariate Data
title_sort systematic review of anomaly detection within high dimensional and multivariate data
publisher JOIV
publishDate 2023
url http://eprints.uthm.edu.my/9805/1/J15862_f3944b7e279a07421e2ed97fc6d397d2.pdf
http://eprints.uthm.edu.my/9805/
_version_ 1778164198100959232