Fraction-score : a new support measure for co-location pattern mining

Co-location data mining are popular on spatial objects with categorical labels, this brings about the interesting pattern that objects with various labels are frequently located in close geographic proximity. Similar to numerous item sets, co location patterns are defined based on a support measure...

Full description

Saved in:
Bibliographic Details
Main Author: Koh, Wei Hao
Other Authors: Cheng Long
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/137975
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-137975
record_format dspace
spelling sg-ntu-dr.10356-1379752020-04-21T00:21:41Z Fraction-score : a new support measure for co-location pattern mining Koh, Wei Hao Cheng Long School of Computer Science and Engineering c.long@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Pattern recognition Co-location data mining are popular on spatial objects with categorical labels, this brings about the interesting pattern that objects with various labels are frequently located in close geographic proximity. Similar to numerous item sets, co location patterns are defined based on a support measure that appraise the popularity or prevalence of a pattern candidate or a label set. Some support measures exist to define the co location patterns and they share an idea of counting the number of instances of a given set C as i ts support, where an instance of C is an object whose objects carry all the labels in C and are located close to one another. However, these support measures suffer from various weaknesses, some will fail to notice all possible instances while the other support measures will ignore some cases when multiple instances overlap . In this FYP , a new support measure will be proposed which is called the Fraction Score. Its main idea is to count instances fractionally if they overlap. Compared to the existing support measures, Fraction Score not only capture all possible instances, but also handles the cases where instances overlap appropriately so that the supports defined are more meaningful and consistent with the desirable anti-monotonicity property . Efficient algorithms have been developed which are significantly faster than a baseline that adapts the state of art to solve the co location pattern mining problem based on Fraction Score. Extensive experiments have been conducted as well using both real and synthetic datasets, which verified the outstanding performance of the Fraction Score and the efficiency of the developed algorithms. Bachelor of Engineering (Computer Science) 2020-04-21T00:16:47Z 2020-04-21T00:16:47Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/137975 en SCSE19-0030 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
Koh, Wei Hao
Fraction-score : a new support measure for co-location pattern mining
description Co-location data mining are popular on spatial objects with categorical labels, this brings about the interesting pattern that objects with various labels are frequently located in close geographic proximity. Similar to numerous item sets, co location patterns are defined based on a support measure that appraise the popularity or prevalence of a pattern candidate or a label set. Some support measures exist to define the co location patterns and they share an idea of counting the number of instances of a given set C as i ts support, where an instance of C is an object whose objects carry all the labels in C and are located close to one another. However, these support measures suffer from various weaknesses, some will fail to notice all possible instances while the other support measures will ignore some cases when multiple instances overlap . In this FYP , a new support measure will be proposed which is called the Fraction Score. Its main idea is to count instances fractionally if they overlap. Compared to the existing support measures, Fraction Score not only capture all possible instances, but also handles the cases where instances overlap appropriately so that the supports defined are more meaningful and consistent with the desirable anti-monotonicity property . Efficient algorithms have been developed which are significantly faster than a baseline that adapts the state of art to solve the co location pattern mining problem based on Fraction Score. Extensive experiments have been conducted as well using both real and synthetic datasets, which verified the outstanding performance of the Fraction Score and the efficiency of the developed algorithms.
author2 Cheng Long
author_facet Cheng Long
Koh, Wei Hao
format Final Year Project
author Koh, Wei Hao
author_sort Koh, Wei Hao
title Fraction-score : a new support measure for co-location pattern mining
title_short Fraction-score : a new support measure for co-location pattern mining
title_full Fraction-score : a new support measure for co-location pattern mining
title_fullStr Fraction-score : a new support measure for co-location pattern mining
title_full_unstemmed Fraction-score : a new support measure for co-location pattern mining
title_sort fraction-score : a new support measure for co-location pattern mining
publisher Nanyang Technological University
publishDate 2020
url https://hdl.handle.net/10356/137975
_version_ 1681059129385287680