A statistical interestingness measures for XML based association rules

Recently mining frequent substructures from XML data has gained a considerable amount of interest. Different methods have been proposed and examined for mining frequent patterns from XML documents efficiently and effectively. While many frequent XML patterns generated are useful and interesting, it...

Full description

Saved in:

Bibliographic Details
Main Authors:	Mohd Shaharanee, Izwan Nizal, Hadzic, Fedja, Dillon, Tharam S.
Other Authors:	Zhang, B.-T.
Format:	Book Section
Language:	English
Published:	Springer-Verlag Berlin Heidelberg 2010
Subjects:	Q Science (General)
Online Access:	http://repo.uum.edu.my/1514/1/En._Izwan_Nizal_Mohd_Shaharanee%5B1%5D.pdf http://repo.uum.edu.my/1514/ http://www.springerlink.com/content/978-3-642-15245-0#section=757498&page=1&locus=48
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Universiti Utara Malaysia
Language:	English

id	my.uum.repo.1514
record_format	eprints
spelling	my.uum.repo.15142016-12-05T08:52:38Z http://repo.uum.edu.my/1514/ A statistical interestingness measures for XML based association rules Mohd Shaharanee, Izwan Nizal Hadzic, Fedja Dillon, Tharam S. Q Science (General) Recently mining frequent substructures from XML data has gained a considerable amount of interest. Different methods have been proposed and examined for mining frequent patterns from XML documents efficiently and effectively. While many frequent XML patterns generated are useful and interesting, it is common that a large portion of them is not considered as interesting or significant for the application at hand. In this paper, we present a systematic approach to ascertain whether the discovered XML patterns are significant and not just coincidental associations, and provide a precise statistical approach to support this framework. The proposed strategy combines data mining and statistical measurement techniques to discard the non significant patterns. In this paper we considered the "Prions" database that describes the protein instances stored for Human Prions Protein. The proposed unified framework is applied on this dataset to demonstrate its effectiveness in assessing interestingness of discovered XML patterns by statistical means.When the dataset is used for classification/prediction purposes, the proposed approach will discard non significant XML patterns, without the cost of a reduction in the accuracy of the pattern set as a whole. Springer-Verlag Berlin Heidelberg Zhang, B.-T. Orgun, M.A. 2010 Book Section PeerReviewed application/pdf en http://repo.uum.edu.my/1514/1/En._Izwan_Nizal_Mohd_Shaharanee%5B1%5D.pdf Mohd Shaharanee, Izwan Nizal and Hadzic, Fedja and Dillon, Tharam S. (2010) A statistical interestingness measures for XML based association rules. In: PRICAI 2010: Trends in Artificial Intelligence: 11th Pacific Rim International Conference on Artificial Intelligence, Daegu, Korea, August 30-September 2, 2010. Proceedings. Springer-Verlag Berlin Heidelberg, Berlin, pp. 194-205. ISBN 9783642152450 http://www.springerlink.com/content/978-3-642-15245-0#section=757498&page=1&locus=48
institution	Universiti Utara Malaysia
building	UUM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Utara Malaysia
content_source	UUM Institutionali Repository
url_provider	http://repo.uum.edu.my/
language	English
topic	Q Science (General)
spellingShingle	Q Science (General) Mohd Shaharanee, Izwan Nizal Hadzic, Fedja Dillon, Tharam S. A statistical interestingness measures for XML based association rules
description	Recently mining frequent substructures from XML data has gained a considerable amount of interest. Different methods have been proposed and examined for mining frequent patterns from XML documents efficiently and effectively. While many frequent XML patterns generated are useful and interesting, it is common that a large portion of them is not considered as interesting or significant for the application at hand. In this paper, we present a systematic approach to ascertain whether the discovered XML patterns are significant and not just coincidental associations, and provide a precise statistical approach to support this framework. The proposed strategy combines data mining and statistical measurement techniques to discard the non significant patterns. In this paper we considered the "Prions" database that describes the protein instances stored for Human Prions Protein. The proposed unified framework is applied on this dataset to demonstrate its effectiveness in assessing interestingness of discovered XML patterns by statistical means.When the dataset is used for classification/prediction purposes, the proposed approach will discard non significant XML patterns, without the cost of a reduction in the accuracy of the pattern set as a whole.
author2	Zhang, B.-T.
author_facet	Zhang, B.-T. Mohd Shaharanee, Izwan Nizal Hadzic, Fedja Dillon, Tharam S.
format	Book Section
author	Mohd Shaharanee, Izwan Nizal Hadzic, Fedja Dillon, Tharam S.
author_sort	Mohd Shaharanee, Izwan Nizal
title	A statistical interestingness measures for XML based association rules
title_short	A statistical interestingness measures for XML based association rules
title_full	A statistical interestingness measures for XML based association rules
title_fullStr	A statistical interestingness measures for XML based association rules
title_full_unstemmed	A statistical interestingness measures for XML based association rules
title_sort	statistical interestingness measures for xml based association rules
publisher	Springer-Verlag Berlin Heidelberg
publishDate	2010
url	http://repo.uum.edu.my/1514/1/En._Izwan_Nizal_Mohd_Shaharanee%5B1%5D.pdf http://repo.uum.edu.my/1514/ http://www.springerlink.com/content/978-3-642-15245-0#section=757498&page=1&locus=48
_version_	1644277995281580032

A statistical interestingness measures for XML based association rules

Similar Items