Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings
Web crawler has been regarded as one of the most effective ways in extracting large amount of data from websites. With information technology, human languages can be understood by natural language processing (NLP) programs to some extent. In this report, web crawling and natural language processi...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | Chinese |
Published: |
2014
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/55818 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | Chinese |
id |
sg-ntu-dr.10356-55818 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-558182023-05-19T05:44:56Z Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings Xu, Yingchun Yang, Guang Zou, Peijun Goh Kim Huat Nanyang Business School DRNTU::Business::Information technology Web crawler has been regarded as one of the most effective ways in extracting large amount of data from websites. With information technology, human languages can be understood by natural language processing (NLP) programs to some extent. In this report, web crawling and natural language processing technology were used to extract reviewer opinions from Tripadvisor webpages. We studied opinions towards 50 hotels located in Las Vegas, Untied States of America, and constructed a model to predict customer ratings in relation to their opinions, experience and hotel ranking. It has been found that reviewer ratings towards a certain hotel has a positive correlation with both reviewer opinions and reviewer experience, and has a negative correlation with hotel ranking. Future research directions include improvement on NLP’s accuracy and applications on other industries such as entertainment, consumer goods, etc. BUSINESS AND COMPUTING 2014-04-01T01:31:57Z 2014-04-01T01:31:57Z 2014 2014 Final Year Project (FYP) http://hdl.handle.net/10356/55818 zh Nanyang Technological University 51 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
Chinese |
topic |
DRNTU::Business::Information technology |
spellingShingle |
DRNTU::Business::Information technology Xu, Yingchun Yang, Guang Zou, Peijun Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings |
description |
Web crawler has been regarded as one of the most effective ways in extracting large amount of data from websites. With information technology, human languages can be understood by natural language processing (NLP) programs to some extent.
In this report, web crawling and natural language processing technology were used to extract reviewer opinions from Tripadvisor webpages. We studied opinions towards 50 hotels located in Las Vegas, Untied States of America, and constructed a model to predict customer ratings in relation to their opinions, experience and hotel ranking. It has been found that reviewer ratings towards a certain hotel has a positive correlation with both reviewer opinions and reviewer experience, and has a negative correlation with hotel ranking.
Future research directions include improvement on NLP’s accuracy and applications on other industries such as entertainment, consumer goods, etc. |
author2 |
Goh Kim Huat |
author_facet |
Goh Kim Huat Xu, Yingchun Yang, Guang Zou, Peijun |
format |
Final Year Project |
author |
Xu, Yingchun Yang, Guang Zou, Peijun |
author_sort |
Xu, Yingchun |
title |
Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings |
title_short |
Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings |
title_full |
Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings |
title_fullStr |
Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings |
title_full_unstemmed |
Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings |
title_sort |
web crawler and nlp enabled data mining : a statistical study on the formation of hotel ratings |
publishDate |
2014 |
url |
http://hdl.handle.net/10356/55818 |
_version_ |
1770565052822192128 |