A heuristic method for correlating attribute group pairs in data mining

Many different kinds of algorithms have been developed to discover relationships between two attribute groups (e.g., association rule discovery algorithms, functional dependency discovery algorithms, and correlation tests). Of these algorithms, only the correlation tests discover relationships using...

Full description

Saved in:
Bibliographic Details
Main Authors: LIM, Ee Peng, CHIANG, Roger Hsiang-Li, CHUA, Cecil
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 1998
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/974
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1973
record_format dspace
spelling sg-smu-ink.sis_research-19732018-06-22T02:04:45Z A heuristic method for correlating attribute group pairs in data mining LIM, Ee Peng CHIANG, Roger Hsiang-Li CHUA, Cecil Many different kinds of algorithms have been developed to discover relationships between two attribute groups (e.g., association rule discovery algorithms, functional dependency discovery algorithms, and correlation tests). Of these algorithms, only the correlation tests discover relationships using the measurement scales of attribute groups. Measurement scales determine whether order or distance information should be considered in the relationship discovery process. Order and distance information limits the possible forms a legitimate relationship between two attribute groups can have. Since this information is considered in correlation tests, the relationships discovered tend not to be spurious. Furthermore, the result of a correlation test can be empirically evaluated by measuring its significance. Often, the appropriate correlation test to apply on an attribute group pair must be selected manually, as information required to identify the appropriate test (e.g., the measurement scale of the attribute groups) is not available in the database. However, information required for test identification can be inferred from the system catalog, and analysis of the values of the attribute groups. In this paper, we propose a (semi-) automated correlation test identification method which infers information for identifying appropriate tests, and measures the correlation between attribute group pairs. 1998-11-01T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/974 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Databases and Information Systems
Numerical Analysis and Scientific Computing
LIM, Ee Peng
CHIANG, Roger Hsiang-Li
CHUA, Cecil
A heuristic method for correlating attribute group pairs in data mining
description Many different kinds of algorithms have been developed to discover relationships between two attribute groups (e.g., association rule discovery algorithms, functional dependency discovery algorithms, and correlation tests). Of these algorithms, only the correlation tests discover relationships using the measurement scales of attribute groups. Measurement scales determine whether order or distance information should be considered in the relationship discovery process. Order and distance information limits the possible forms a legitimate relationship between two attribute groups can have. Since this information is considered in correlation tests, the relationships discovered tend not to be spurious. Furthermore, the result of a correlation test can be empirically evaluated by measuring its significance. Often, the appropriate correlation test to apply on an attribute group pair must be selected manually, as information required to identify the appropriate test (e.g., the measurement scale of the attribute groups) is not available in the database. However, information required for test identification can be inferred from the system catalog, and analysis of the values of the attribute groups. In this paper, we propose a (semi-) automated correlation test identification method which infers information for identifying appropriate tests, and measures the correlation between attribute group pairs.
format text
author LIM, Ee Peng
CHIANG, Roger Hsiang-Li
CHUA, Cecil
author_facet LIM, Ee Peng
CHIANG, Roger Hsiang-Li
CHUA, Cecil
author_sort LIM, Ee Peng
title A heuristic method for correlating attribute group pairs in data mining
title_short A heuristic method for correlating attribute group pairs in data mining
title_full A heuristic method for correlating attribute group pairs in data mining
title_fullStr A heuristic method for correlating attribute group pairs in data mining
title_full_unstemmed A heuristic method for correlating attribute group pairs in data mining
title_sort heuristic method for correlating attribute group pairs in data mining
publisher Institutional Knowledge at Singapore Management University
publishDate 1998
url https://ink.library.smu.edu.sg/sis_research/974
_version_ 1770570809792790528