Personalized classification for keyword-based category profiles

Personalized classification refers to allowing users to define their own categories and automating the assignment of documents to these categories. In this paper, we examine the use of keywords to define personalized categories and propose the use of Support Vector Machine (SVM) to perform personali...

Full description

Saved in:
Bibliographic Details
Main Authors: SUN, Aixin, LIM, Ee Peng, NG, Wee-Keong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2002
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/980
https://ink.library.smu.edu.sg/context/sis_research/article/1979/viewcontent/7b04f5e1715c4617b221629aefda938acfc2.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1979
record_format dspace
spelling sg-smu-ink.sis_research-19792018-06-20T05:35:18Z Personalized classification for keyword-based category profiles SUN, Aixin LIM, Ee Peng NG, Wee-Keong Personalized classification refers to allowing users to define their own categories and automating the assignment of documents to these categories. In this paper, we examine the use of keywords to define personalized categories and propose the use of Support Vector Machine (SVM) to perform personalized classification. Two scenarios have been investigated. The first assumes that the personalized categories are defined in a flat category space. The second assumes that each personalized category is defined within a pre-defined general category that provides a more specific context for the personalized category. The training documents for personalized categories are obtained from a training document pool using a search engine and a set of keywords. Our experiments have delivered better classification results using the second scenario. We also conclude that the number of keywords used can be very small and increasing them does not always lead to better classification performance. 2002-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/980 info:doi/10.1007/3-540-45747-X_5 https://ink.library.smu.edu.sg/context/sis_research/article/1979/viewcontent/7b04f5e1715c4617b221629aefda938acfc2.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Databases and Information Systems
Numerical Analysis and Scientific Computing
SUN, Aixin
LIM, Ee Peng
NG, Wee-Keong
Personalized classification for keyword-based category profiles
description Personalized classification refers to allowing users to define their own categories and automating the assignment of documents to these categories. In this paper, we examine the use of keywords to define personalized categories and propose the use of Support Vector Machine (SVM) to perform personalized classification. Two scenarios have been investigated. The first assumes that the personalized categories are defined in a flat category space. The second assumes that each personalized category is defined within a pre-defined general category that provides a more specific context for the personalized category. The training documents for personalized categories are obtained from a training document pool using a search engine and a set of keywords. Our experiments have delivered better classification results using the second scenario. We also conclude that the number of keywords used can be very small and increasing them does not always lead to better classification performance.
format text
author SUN, Aixin
LIM, Ee Peng
NG, Wee-Keong
author_facet SUN, Aixin
LIM, Ee Peng
NG, Wee-Keong
author_sort SUN, Aixin
title Personalized classification for keyword-based category profiles
title_short Personalized classification for keyword-based category profiles
title_full Personalized classification for keyword-based category profiles
title_fullStr Personalized classification for keyword-based category profiles
title_full_unstemmed Personalized classification for keyword-based category profiles
title_sort personalized classification for keyword-based category profiles
publisher Institutional Knowledge at Singapore Management University
publishDate 2002
url https://ink.library.smu.edu.sg/sis_research/980
https://ink.library.smu.edu.sg/context/sis_research/article/1979/viewcontent/7b04f5e1715c4617b221629aefda938acfc2.pdf
_version_ 1770570813040230400