Text mining of CRM data using decision tree induction and support vector machines.

In today's business environment, the organization's call centre has become an important reservoir of customer intelligence. This intelligence can be tapped upon to develop better products and service approach to enhance customer satisfaction and improve the organization's bottom-line...

Full description

Saved in:
Bibliographic Details
Main Author: Cheung, Chee Wai.
Other Authors: Khoo Soo Guan, Christopher
Format: Theses and Dissertations
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/10356/42237
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-42237
record_format dspace
spelling sg-ntu-dr.10356-422372019-12-10T12:07:31Z Text mining of CRM data using decision tree induction and support vector machines. Cheung, Chee Wai. Khoo Soo Guan, Christopher Wee Kim Wee School of Communication and Information DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval In today's business environment, the organization's call centre has become an important reservoir of customer intelligence. This intelligence can be tapped upon to develop better products and service approach to enhance customer satisfaction and improve the organization's bottom-line. This study seeks to develop an automatic method to categorize the call centre CRM data of a consumer electronics manufacturer. The study contributes to the existing literature by comparing and evaluating the text mining modeling techniques of Decision Tree Induction and Support Vector Machines as applied to call centre CRM data in free text form. In the sphere of feature selection, the impact of bigrams, extracted keywords and expert selected keywords are discussed. The Decision Tree Induction model produces prediction accuracy of 52.74% and the SVM model generated prediction accuracy of 60.4%. While the SVM model provides the higher prediction accuracy, the Decision Tree Induction model has demonstrated its usefulness in the interpretation and explanation of the results, especially in understanding the impact of influential keywords on specific target topics. The recommendation is to incorporate both the Decision Tree Induction and SVM models in the automated process of categorizing the call centre data. The inclusion of bigrams and expert selected keywords would have a positive impact towards the text mining process. The limitations of this study include the need to involve domain experts, overcome the spelling and grammatical errors in the dataset and the small number of cases for certain target topics. Master of Science (Information Systems) 2010-10-04T07:25:48Z 2010-10-04T07:25:48Z 2008 2008 Thesis http://hdl.handle.net/10356/42237 en Nanyang Technological University 190 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval
spellingShingle DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval
Cheung, Chee Wai.
Text mining of CRM data using decision tree induction and support vector machines.
description In today's business environment, the organization's call centre has become an important reservoir of customer intelligence. This intelligence can be tapped upon to develop better products and service approach to enhance customer satisfaction and improve the organization's bottom-line. This study seeks to develop an automatic method to categorize the call centre CRM data of a consumer electronics manufacturer. The study contributes to the existing literature by comparing and evaluating the text mining modeling techniques of Decision Tree Induction and Support Vector Machines as applied to call centre CRM data in free text form. In the sphere of feature selection, the impact of bigrams, extracted keywords and expert selected keywords are discussed. The Decision Tree Induction model produces prediction accuracy of 52.74% and the SVM model generated prediction accuracy of 60.4%. While the SVM model provides the higher prediction accuracy, the Decision Tree Induction model has demonstrated its usefulness in the interpretation and explanation of the results, especially in understanding the impact of influential keywords on specific target topics. The recommendation is to incorporate both the Decision Tree Induction and SVM models in the automated process of categorizing the call centre data. The inclusion of bigrams and expert selected keywords would have a positive impact towards the text mining process. The limitations of this study include the need to involve domain experts, overcome the spelling and grammatical errors in the dataset and the small number of cases for certain target topics.
author2 Khoo Soo Guan, Christopher
author_facet Khoo Soo Guan, Christopher
Cheung, Chee Wai.
format Theses and Dissertations
author Cheung, Chee Wai.
author_sort Cheung, Chee Wai.
title Text mining of CRM data using decision tree induction and support vector machines.
title_short Text mining of CRM data using decision tree induction and support vector machines.
title_full Text mining of CRM data using decision tree induction and support vector machines.
title_fullStr Text mining of CRM data using decision tree induction and support vector machines.
title_full_unstemmed Text mining of CRM data using decision tree induction and support vector machines.
title_sort text mining of crm data using decision tree induction and support vector machines.
publishDate 2010
url http://hdl.handle.net/10356/42237
_version_ 1681035026341298176