Approximation algorithms for mining patterns from data streams

Traditional data mining techniques expect all data to be managed within some form of persistent datasets. Recently, for many emerging applications, such as stock tickers, web-click streams, and telecom call records, the concept of a \textit{data stream} is more appropriate than a stored dataset....

Full description

Saved in:

Bibliographic Details
Main Author:	Dang, Xuan Hong
Other Authors:	Ng, Wee Keong
Format:	Theses and Dissertations
Language:	English
Published:	2008
Subjects:	DRNTU::Engineering::Computer science and engineering::Information systems::Database management
Online Access:	https://hdl.handle.net/10356/13581
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-13581
record_format	dspace
spelling	sg-ntu-dr.10356-135812023-03-04T00:39:47Z Approximation algorithms for mining patterns from data streams Dang, Xuan Hong Ng, Wee Keong School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Information systems::Database management Traditional data mining techniques expect all data to be managed within some form of persistent datasets. Recently, for many emerging applications, such as stock tickers, web-click streams, and telecom call records, the concept of a \textit{data stream} is more appropriate than a stored dataset. Naturally, a data stream is generated continuously in a dynamic environment with huge volume, infinite flow, and fast changing behaviors. Furthermore, they usually arrive to a mining system in a push-based manner meanwhile system resources used in the mining process are generally restricted in advance. Consequently, there have been increasing demands for developing novel techniques that are able to discover interesting patterns from data streams while they work within system resource constraints. Moreover, the mining results returned by these techniques are often desirable to be guaranteed within some error. When such an important task is completed, it is strongly believed that the quality of making decisions can be improved significantly in streaming environments. This research aims to study and investigate various approximation algorithms in order to effectively and efficiently mine useful patterns from data streams under different system resource constraints. Two fundamental data mining tasks are explored in the streaming data context: frequent pattern discovering and cluster analysis. The contributions of this research are claimed as follows: A novel algorithm named EStream is developed to address the problem of online mining frequent patterns from data streams with precise error guarantee. DOCTOR OF PHILOSOPHY (SCE) 2008-10-20T09:57:16Z 2008-10-20T09:57:16Z 2008 2008 Thesis Dang, X. H. (2008). Approximation algorithms for mining patterns from data streams. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/13581 10.32657/10356/13581 en 189 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering::Information systems::Database management
spellingShingle	DRNTU::Engineering::Computer science and engineering::Information systems::Database management Dang, Xuan Hong Approximation algorithms for mining patterns from data streams
description	Traditional data mining techniques expect all data to be managed within some form of persistent datasets. Recently, for many emerging applications, such as stock tickers, web-click streams, and telecom call records, the concept of a \textit{data stream} is more appropriate than a stored dataset. Naturally, a data stream is generated continuously in a dynamic environment with huge volume, infinite flow, and fast changing behaviors. Furthermore, they usually arrive to a mining system in a push-based manner meanwhile system resources used in the mining process are generally restricted in advance. Consequently, there have been increasing demands for developing novel techniques that are able to discover interesting patterns from data streams while they work within system resource constraints. Moreover, the mining results returned by these techniques are often desirable to be guaranteed within some error. When such an important task is completed, it is strongly believed that the quality of making decisions can be improved significantly in streaming environments. This research aims to study and investigate various approximation algorithms in order to effectively and efficiently mine useful patterns from data streams under different system resource constraints. Two fundamental data mining tasks are explored in the streaming data context: frequent pattern discovering and cluster analysis. The contributions of this research are claimed as follows: A novel algorithm named EStream is developed to address the problem of online mining frequent patterns from data streams with precise error guarantee.
author2	Ng, Wee Keong
author_facet	Ng, Wee Keong Dang, Xuan Hong
format	Theses and Dissertations
author	Dang, Xuan Hong
author_sort	Dang, Xuan Hong
title	Approximation algorithms for mining patterns from data streams
title_short	Approximation algorithms for mining patterns from data streams
title_full	Approximation algorithms for mining patterns from data streams
title_fullStr	Approximation algorithms for mining patterns from data streams
title_full_unstemmed	Approximation algorithms for mining patterns from data streams
title_sort	approximation algorithms for mining patterns from data streams
publishDate	2008
url	https://hdl.handle.net/10356/13581
_version_	1759857302039429120

Approximation algorithms for mining patterns from data streams

Similar Items