Text mining of CRM data using decision tree induction and support vector machines.

In today's business environment, the organization's call centre has become an important reservoir of customer intelligence. This intelligence can be tapped upon to develop better products and service approach to enhance customer satisfaction and improve the organization's bottom-line...

Full description

Saved in:

Bibliographic Details
Main Author:	Cheung, Chee Wai.
Other Authors:	Khoo Soo Guan, Christopher
Format:	Theses and Dissertations
Language:	English
Published:	2010
Subjects:	DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval
Online Access:	http://hdl.handle.net/10356/42237
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-42237
record_format	dspace
spelling	sg-ntu-dr.10356-422372019-12-10T12:07:31Z Text mining of CRM data using decision tree induction and support vector machines. Cheung, Chee Wai. Khoo Soo Guan, Christopher Wee Kim Wee School of Communication and Information DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval In today's business environment, the organization's call centre has become an important reservoir of customer intelligence. This intelligence can be tapped upon to develop better products and service approach to enhance customer satisfaction and improve the organization's bottom-line. This study seeks to develop an automatic method to categorize the call centre CRM data of a consumer electronics manufacturer. The study contributes to the existing literature by comparing and evaluating the text mining modeling techniques of Decision Tree Induction and Support Vector Machines as applied to call centre CRM data in free text form. In the sphere of feature selection, the impact of bigrams, extracted keywords and expert selected keywords are discussed. The Decision Tree Induction model produces prediction accuracy of 52.74% and the SVM model generated prediction accuracy of 60.4%. While the SVM model provides the higher prediction accuracy, the Decision Tree Induction model has demonstrated its usefulness in the interpretation and explanation of the results, especially in understanding the impact of influential keywords on specific target topics. The recommendation is to incorporate both the Decision Tree Induction and SVM models in the automated process of categorizing the call centre data. The inclusion of bigrams and expert selected keywords would have a positive impact towards the text mining process. The limitations of this study include the need to involve domain experts, overcome the spelling and grammatical errors in the dataset and the small number of cases for certain target topics. Master of Science (Information Systems) 2010-10-04T07:25:48Z 2010-10-04T07:25:48Z 2008 2008 Thesis http://hdl.handle.net/10356/42237 en Nanyang Technological University 190 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval
spellingShingle	DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval Cheung, Chee Wai. Text mining of CRM data using decision tree induction and support vector machines.
description	In today's business environment, the organization's call centre has become an important reservoir of customer intelligence. This intelligence can be tapped upon to develop better products and service approach to enhance customer satisfaction and improve the organization's bottom-line. This study seeks to develop an automatic method to categorize the call centre CRM data of a consumer electronics manufacturer. The study contributes to the existing literature by comparing and evaluating the text mining modeling techniques of Decision Tree Induction and Support Vector Machines as applied to call centre CRM data in free text form. In the sphere of feature selection, the impact of bigrams, extracted keywords and expert selected keywords are discussed. The Decision Tree Induction model produces prediction accuracy of 52.74% and the SVM model generated prediction accuracy of 60.4%. While the SVM model provides the higher prediction accuracy, the Decision Tree Induction model has demonstrated its usefulness in the interpretation and explanation of the results, especially in understanding the impact of influential keywords on specific target topics. The recommendation is to incorporate both the Decision Tree Induction and SVM models in the automated process of categorizing the call centre data. The inclusion of bigrams and expert selected keywords would have a positive impact towards the text mining process. The limitations of this study include the need to involve domain experts, overcome the spelling and grammatical errors in the dataset and the small number of cases for certain target topics.
author2	Khoo Soo Guan, Christopher
author_facet	Khoo Soo Guan, Christopher Cheung, Chee Wai.
format	Theses and Dissertations
author	Cheung, Chee Wai.
author_sort	Cheung, Chee Wai.
title	Text mining of CRM data using decision tree induction and support vector machines.
title_short	Text mining of CRM data using decision tree induction and support vector machines.
title_full	Text mining of CRM data using decision tree induction and support vector machines.
title_fullStr	Text mining of CRM data using decision tree induction and support vector machines.
title_full_unstemmed	Text mining of CRM data using decision tree induction and support vector machines.
title_sort	text mining of crm data using decision tree induction and support vector machines.
publishDate	2010
url	http://hdl.handle.net/10356/42237
_version_	1681035026341298176

Text mining of CRM data using decision tree induction and support vector machines.

Similar Items