Outlier detection based on neighborhood proximity

Outliers, also called anomalies are data patterns that do not conform to the behavior that is expected or differ too much from the rest. In some cases, outliers could be caused by errors in data generating/collecting methods or by inherent data variability. However, in many situations, outliers are...

Full description

Saved in:

Bibliographic Details
Main Author:	Nguyen, Hoang Vu
Other Authors:	Vivekanand Gopalkrishnan
Format:	Theses and Dissertations
Language:	English
Published:	2010
Subjects:	DRNTU::Engineering::Computer science and engineering::Information systems::Information systems applications DRNTU::Engineering::Computer science and engineering::Information systems::Database management
Online Access:	https://hdl.handle.net/10356/42448
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-42448
record_format	dspace
spelling	sg-ntu-dr.10356-424482023-03-04T00:44:50Z Outlier detection based on neighborhood proximity Nguyen, Hoang Vu Vivekanand Gopalkrishnan School of Computer Engineering Centre for Advanced Information Systems DRNTU::Engineering::Computer science and engineering::Information systems::Information systems applications DRNTU::Engineering::Computer science and engineering::Information systems::Database management Outliers, also called anomalies are data patterns that do not conform to the behavior that is expected or differ too much from the rest. In some cases, outliers could be caused by errors in data generating/collecting methods or by inherent data variability. However, in many situations, outliers are indications of interesting events that have never been known before and hence, an adaptation of the theory to capture the new events is required to explore the underlying mechanisms. The two-side effect of outliers necessitates the development of efficient methods to detect them for either (a) eliminating/minimizing their impacts on general performance of information systems or (b) capturing the underlying interesting knowledge (e.g. intrusive connections in a network). In general, outlier detection has many practical applications, especially in domains that have scope for abnormal behavior, such as fraud detection, network intrusion detection, medical diagnosis, marketing, customer segmentation, etc. There are many ways in practice to solve our problem of interest. This thesis deals specifically with outlier notions based on measures of neighborhood dissimilarity. Related works can be divided into two main categories: distance-based and density-based. In our study, we place our focus more on distance-based approaches. With considerations to the limitations of existing works, we propose two techniques, tackling separate aspects of outlier detection. MASTER OF ENGINEERING (SCE) 2010-12-02T08:26:23Z 2010-12-02T08:26:23Z 2010 2010 Thesis Nguyen, H. V. (2010). Outlier detection based on neighborhood proximity.. Master’s thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/42448 10.32657/10356/42448 en 90 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering::Information systems::Information systems applications DRNTU::Engineering::Computer science and engineering::Information systems::Database management
spellingShingle	DRNTU::Engineering::Computer science and engineering::Information systems::Information systems applications DRNTU::Engineering::Computer science and engineering::Information systems::Database management Nguyen, Hoang Vu Outlier detection based on neighborhood proximity
description	Outliers, also called anomalies are data patterns that do not conform to the behavior that is expected or differ too much from the rest. In some cases, outliers could be caused by errors in data generating/collecting methods or by inherent data variability. However, in many situations, outliers are indications of interesting events that have never been known before and hence, an adaptation of the theory to capture the new events is required to explore the underlying mechanisms. The two-side effect of outliers necessitates the development of efficient methods to detect them for either (a) eliminating/minimizing their impacts on general performance of information systems or (b) capturing the underlying interesting knowledge (e.g. intrusive connections in a network). In general, outlier detection has many practical applications, especially in domains that have scope for abnormal behavior, such as fraud detection, network intrusion detection, medical diagnosis, marketing, customer segmentation, etc. There are many ways in practice to solve our problem of interest. This thesis deals specifically with outlier notions based on measures of neighborhood dissimilarity. Related works can be divided into two main categories: distance-based and density-based. In our study, we place our focus more on distance-based approaches. With considerations to the limitations of existing works, we propose two techniques, tackling separate aspects of outlier detection.
author2	Vivekanand Gopalkrishnan
author_facet	Vivekanand Gopalkrishnan Nguyen, Hoang Vu
format	Theses and Dissertations
author	Nguyen, Hoang Vu
author_sort	Nguyen, Hoang Vu
title	Outlier detection based on neighborhood proximity
title_short	Outlier detection based on neighborhood proximity
title_full	Outlier detection based on neighborhood proximity
title_fullStr	Outlier detection based on neighborhood proximity
title_full_unstemmed	Outlier detection based on neighborhood proximity
title_sort	outlier detection based on neighborhood proximity
publishDate	2010
url	https://hdl.handle.net/10356/42448
_version_	1759855929410453504

Outlier detection based on neighborhood proximity

Similar Items