Text mining of CRM data using decision tree induction and support vector machines.
In today's business environment, the organization's call centre has become an important reservoir of customer intelligence. This intelligence can be tapped upon to develop better products and service approach to enhance customer satisfaction and improve the organization's bottom-line...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2010
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/42237 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-42237 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-422372019-12-10T12:07:31Z Text mining of CRM data using decision tree induction and support vector machines. Cheung, Chee Wai. Khoo Soo Guan, Christopher Wee Kim Wee School of Communication and Information DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval In today's business environment, the organization's call centre has become an important reservoir of customer intelligence. This intelligence can be tapped upon to develop better products and service approach to enhance customer satisfaction and improve the organization's bottom-line. This study seeks to develop an automatic method to categorize the call centre CRM data of a consumer electronics manufacturer. The study contributes to the existing literature by comparing and evaluating the text mining modeling techniques of Decision Tree Induction and Support Vector Machines as applied to call centre CRM data in free text form. In the sphere of feature selection, the impact of bigrams, extracted keywords and expert selected keywords are discussed. The Decision Tree Induction model produces prediction accuracy of 52.74% and the SVM model generated prediction accuracy of 60.4%. While the SVM model provides the higher prediction accuracy, the Decision Tree Induction model has demonstrated its usefulness in the interpretation and explanation of the results, especially in understanding the impact of influential keywords on specific target topics. The recommendation is to incorporate both the Decision Tree Induction and SVM models in the automated process of categorizing the call centre data. The inclusion of bigrams and expert selected keywords would have a positive impact towards the text mining process. The limitations of this study include the need to involve domain experts, overcome the spelling and grammatical errors in the dataset and the small number of cases for certain target topics. Master of Science (Information Systems) 2010-10-04T07:25:48Z 2010-10-04T07:25:48Z 2008 2008 Thesis http://hdl.handle.net/10356/42237 en Nanyang Technological University 190 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval Cheung, Chee Wai. Text mining of CRM data using decision tree induction and support vector machines. |
description |
In today's business environment, the organization's call centre has become an
important reservoir of customer intelligence. This intelligence can be tapped upon to develop better products and service approach to enhance customer satisfaction and improve the organization's bottom-line. This study seeks to develop an automatic method to categorize the call centre CRM data of a consumer electronics
manufacturer. The study contributes to the existing literature by comparing and
evaluating the text mining modeling techniques of Decision Tree Induction and
Support Vector Machines as applied to call centre CRM data in free text form. In the
sphere of feature selection, the impact of bigrams, extracted keywords and expert
selected keywords are discussed.
The Decision Tree Induction model produces prediction accuracy of 52.74% and the
SVM model generated prediction accuracy of 60.4%. While the SVM model provides
the higher prediction accuracy, the Decision Tree Induction model has demonstrated
its usefulness in the interpretation and explanation of the results, especially in
understanding the impact of influential keywords on specific target topics.
The recommendation is to incorporate both the Decision Tree Induction and SVM
models in the automated process of categorizing the call centre data. The inclusion of bigrams and expert selected keywords would have a positive impact towards the text mining process. The limitations of this study include the need to involve domain experts, overcome the spelling and grammatical errors in the dataset and the small number of cases for certain target topics. |
author2 |
Khoo Soo Guan, Christopher |
author_facet |
Khoo Soo Guan, Christopher Cheung, Chee Wai. |
format |
Theses and Dissertations |
author |
Cheung, Chee Wai. |
author_sort |
Cheung, Chee Wai. |
title |
Text mining of CRM data using decision tree induction and support vector machines. |
title_short |
Text mining of CRM data using decision tree induction and support vector machines. |
title_full |
Text mining of CRM data using decision tree induction and support vector machines. |
title_fullStr |
Text mining of CRM data using decision tree induction and support vector machines. |
title_full_unstemmed |
Text mining of CRM data using decision tree induction and support vector machines. |
title_sort |
text mining of crm data using decision tree induction and support vector machines. |
publishDate |
2010 |
url |
http://hdl.handle.net/10356/42237 |
_version_ |
1681035026341298176 |