Fraction-score : a new support measure for co-location pattern mining
Co-location data mining are popular on spatial objects with categorical labels, this brings about the interesting pattern that objects with various labels are frequently located in close geographic proximity. Similar to numerous item sets, co location patterns are defined based on a support measure...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/137975 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-137975 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1379752020-04-21T00:21:41Z Fraction-score : a new support measure for co-location pattern mining Koh, Wei Hao Cheng Long School of Computer Science and Engineering c.long@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Pattern recognition Co-location data mining are popular on spatial objects with categorical labels, this brings about the interesting pattern that objects with various labels are frequently located in close geographic proximity. Similar to numerous item sets, co location patterns are defined based on a support measure that appraise the popularity or prevalence of a pattern candidate or a label set. Some support measures exist to define the co location patterns and they share an idea of counting the number of instances of a given set C as i ts support, where an instance of C is an object whose objects carry all the labels in C and are located close to one another. However, these support measures suffer from various weaknesses, some will fail to notice all possible instances while the other support measures will ignore some cases when multiple instances overlap . In this FYP , a new support measure will be proposed which is called the Fraction Score. Its main idea is to count instances fractionally if they overlap. Compared to the existing support measures, Fraction Score not only capture all possible instances, but also handles the cases where instances overlap appropriately so that the supports defined are more meaningful and consistent with the desirable anti-monotonicity property . Efficient algorithms have been developed which are significantly faster than a baseline that adapts the state of art to solve the co location pattern mining problem based on Fraction Score. Extensive experiments have been conducted as well using both real and synthetic datasets, which verified the outstanding performance of the Fraction Score and the efficiency of the developed algorithms. Bachelor of Engineering (Computer Science) 2020-04-21T00:16:47Z 2020-04-21T00:16:47Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/137975 en SCSE19-0030 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Pattern recognition |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Pattern recognition Koh, Wei Hao Fraction-score : a new support measure for co-location pattern mining |
description |
Co-location data mining are popular on spatial objects with categorical labels, this brings about the interesting pattern that objects with various labels are frequently located in close geographic proximity. Similar to numerous item sets, co location patterns are defined based on a support measure that appraise the popularity or prevalence of a pattern candidate or a label set. Some support measures exist to define the co location patterns and they share an idea of counting the number of instances of a given set C as i ts support, where an instance of C is an object whose objects carry all the labels in C and are located close to one another.
However, these support measures suffer from various weaknesses, some will fail to notice all
possible instances while the other support measures will ignore some cases when multiple
instances overlap . In this FYP , a new support measure will be proposed which is called the
Fraction Score. Its main idea is to count instances fractionally if they overlap. Compared to
the existing support measures, Fraction Score not only capture all possible instances, but also
handles the cases where instances overlap appropriately so that the supports defined are more
meaningful and consistent with the desirable anti-monotonicity property . Efficient algorithms
have been developed which are significantly faster than a baseline that adapts the state of art
to solve the co location pattern mining problem based on Fraction Score. Extensive experiments have been conducted as well using both real and synthetic datasets, which verified the outstanding performance of the Fraction Score and the efficiency of the developed algorithms. |
author2 |
Cheng Long |
author_facet |
Cheng Long Koh, Wei Hao |
format |
Final Year Project |
author |
Koh, Wei Hao |
author_sort |
Koh, Wei Hao |
title |
Fraction-score : a new support measure for co-location pattern mining |
title_short |
Fraction-score : a new support measure for co-location pattern mining |
title_full |
Fraction-score : a new support measure for co-location pattern mining |
title_fullStr |
Fraction-score : a new support measure for co-location pattern mining |
title_full_unstemmed |
Fraction-score : a new support measure for co-location pattern mining |
title_sort |
fraction-score : a new support measure for co-location pattern mining |
publisher |
Nanyang Technological University |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/137975 |
_version_ |
1681059129385287680 |