Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings

Web crawler has been regarded as one of the most effective ways in extracting large amount of data from websites. With information technology, human languages can be understood by natural language processing (NLP) programs to some extent. In this report, web crawling and natural language processi...

Full description

Saved in:
Bibliographic Details
Main Authors: Xu, Yingchun, Yang, Guang, Zou, Peijun
Other Authors: Goh Kim Huat
Format: Final Year Project
Language:Chinese
Published: 2014
Subjects:
Online Access:http://hdl.handle.net/10356/55818
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: Chinese
id sg-ntu-dr.10356-55818
record_format dspace
spelling sg-ntu-dr.10356-558182023-05-19T05:44:56Z Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings Xu, Yingchun Yang, Guang Zou, Peijun Goh Kim Huat Nanyang Business School DRNTU::Business::Information technology Web crawler has been regarded as one of the most effective ways in extracting large amount of data from websites. With information technology, human languages can be understood by natural language processing (NLP) programs to some extent. In this report, web crawling and natural language processing technology were used to extract reviewer opinions from Tripadvisor webpages. We studied opinions towards 50 hotels located in Las Vegas, Untied States of America, and constructed a model to predict customer ratings in relation to their opinions, experience and hotel ranking. It has been found that reviewer ratings towards a certain hotel has a positive correlation with both reviewer opinions and reviewer experience, and has a negative correlation with hotel ranking. Future research directions include improvement on NLP’s accuracy and applications on other industries such as entertainment, consumer goods, etc. BUSINESS AND COMPUTING 2014-04-01T01:31:57Z 2014-04-01T01:31:57Z 2014 2014 Final Year Project (FYP) http://hdl.handle.net/10356/55818 zh Nanyang Technological University 51 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language Chinese
topic DRNTU::Business::Information technology
spellingShingle DRNTU::Business::Information technology
Xu, Yingchun
Yang, Guang
Zou, Peijun
Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings
description Web crawler has been regarded as one of the most effective ways in extracting large amount of data from websites. With information technology, human languages can be understood by natural language processing (NLP) programs to some extent. In this report, web crawling and natural language processing technology were used to extract reviewer opinions from Tripadvisor webpages. We studied opinions towards 50 hotels located in Las Vegas, Untied States of America, and constructed a model to predict customer ratings in relation to their opinions, experience and hotel ranking. It has been found that reviewer ratings towards a certain hotel has a positive correlation with both reviewer opinions and reviewer experience, and has a negative correlation with hotel ranking. Future research directions include improvement on NLP’s accuracy and applications on other industries such as entertainment, consumer goods, etc.
author2 Goh Kim Huat
author_facet Goh Kim Huat
Xu, Yingchun
Yang, Guang
Zou, Peijun
format Final Year Project
author Xu, Yingchun
Yang, Guang
Zou, Peijun
author_sort Xu, Yingchun
title Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings
title_short Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings
title_full Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings
title_fullStr Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings
title_full_unstemmed Web crawler and NLP enabled data mining : a statistical study on the formation of hotel ratings
title_sort web crawler and nlp enabled data mining : a statistical study on the formation of hotel ratings
publishDate 2014
url http://hdl.handle.net/10356/55818
_version_ 1770565052822192128