Text mining Wikipedia to discover alternative destinations

This paper discusses an application of some statistical Natural Language Processing algorithms to a set of articles from Wikipedia about top tourist destinations. The objective is to automatically identify the key features of each destination and then discover other destinations which share similar...

Full description

Saved in:
Bibliographic Details
Main Author: Kenneth Cosh
Format: Conference Proceeding
Published: 2018
Online Access:https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84883394861&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/47645
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
Description
Summary:This paper discusses an application of some statistical Natural Language Processing algorithms to a set of articles from Wikipedia about top tourist destinations. The objective is to automatically identify the key features of each destination and then discover other destinations which share similar sets of features. Through this a method is demonstrated by which meta data about each article can be extracted from the unstructured text and then used to answer complex discovery type queries. The paper compares an approach to automatically clustering similar destinations with a more user driven feature focused technique. © 2013 IEEE.