Semantic text analytic services on the cloud

Text Analytics has applications in many areas. However, when Big Data is involved, there is a need to implement Text Analytic Services on the Cloud, due to the limitations of individual machines. Nonetheless, performing computation on large volumes of data is difficult. Even if an implementation wor...

Full description

Saved in:
Bibliographic Details
Main Author: Ng, Wei Kok.
Other Authors: Chan Chee Keong
Format: Final Year Project
Language:English
Published: 2013
Subjects:
Online Access:http://hdl.handle.net/10356/54449
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-54449
record_format dspace
spelling sg-ntu-dr.10356-544492023-07-07T17:18:04Z Semantic text analytic services on the cloud Ng, Wei Kok. Chan Chee Keong School of Electrical and Electronic Engineering A*STAR Institute for Infocomm Research Flora Tsai Rajaraman Kanagasabai DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing Text Analytics has applications in many areas. However, when Big Data is involved, there is a need to implement Text Analytic Services on the Cloud, due to the limitations of individual machines. Nonetheless, performing computation on large volumes of data is difficult. Even if an implementation works on 10 machines, to scale up to 100s or even 1000s of machines would require major changes to the implementation. The Hadoop framework ensures reliability and availability using unreliable commodity hardware. Hadoop also has a linear scaling, even when scaling up by orders of magnitude, giving Hadoop an advantage over other forms of distributed computing. By implementing GATE, a widely used Text Analytic tool, on the Hadoop framework using MapReduce, this project aims to enable Text Analysis on Big Data, with the linear scaling provided by the Hadoop framework. Performance analysis of the implementation shows that there is indeed a linear scaling when processing with increasing number of machines. As GATE can be used for a multitude of Text Analytic purposes, this implementation will allow the analysis of Big Data in many areas of interest. Bachelor of Engineering 2013-06-20T06:52:54Z 2013-06-20T06:52:54Z 2013 2013 Final Year Project (FYP) http://hdl.handle.net/10356/54449 en Nanyang Technological University 69 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing
Ng, Wei Kok.
Semantic text analytic services on the cloud
description Text Analytics has applications in many areas. However, when Big Data is involved, there is a need to implement Text Analytic Services on the Cloud, due to the limitations of individual machines. Nonetheless, performing computation on large volumes of data is difficult. Even if an implementation works on 10 machines, to scale up to 100s or even 1000s of machines would require major changes to the implementation. The Hadoop framework ensures reliability and availability using unreliable commodity hardware. Hadoop also has a linear scaling, even when scaling up by orders of magnitude, giving Hadoop an advantage over other forms of distributed computing. By implementing GATE, a widely used Text Analytic tool, on the Hadoop framework using MapReduce, this project aims to enable Text Analysis on Big Data, with the linear scaling provided by the Hadoop framework. Performance analysis of the implementation shows that there is indeed a linear scaling when processing with increasing number of machines. As GATE can be used for a multitude of Text Analytic purposes, this implementation will allow the analysis of Big Data in many areas of interest.
author2 Chan Chee Keong
author_facet Chan Chee Keong
Ng, Wei Kok.
format Final Year Project
author Ng, Wei Kok.
author_sort Ng, Wei Kok.
title Semantic text analytic services on the cloud
title_short Semantic text analytic services on the cloud
title_full Semantic text analytic services on the cloud
title_fullStr Semantic text analytic services on the cloud
title_full_unstemmed Semantic text analytic services on the cloud
title_sort semantic text analytic services on the cloud
publishDate 2013
url http://hdl.handle.net/10356/54449
_version_ 1772829072387211264