Semantic text analytic services on the cloud
Text Analytics has applications in many areas. However, when Big Data is involved, there is a need to implement Text Analytic Services on the Cloud, due to the limitations of individual machines. Nonetheless, performing computation on large volumes of data is difficult. Even if an implementation wor...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/54449 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-54449 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-544492023-07-07T17:18:04Z Semantic text analytic services on the cloud Ng, Wei Kok. Chan Chee Keong School of Electrical and Electronic Engineering A*STAR Institute for Infocomm Research Flora Tsai Rajaraman Kanagasabai DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing Text Analytics has applications in many areas. However, when Big Data is involved, there is a need to implement Text Analytic Services on the Cloud, due to the limitations of individual machines. Nonetheless, performing computation on large volumes of data is difficult. Even if an implementation works on 10 machines, to scale up to 100s or even 1000s of machines would require major changes to the implementation. The Hadoop framework ensures reliability and availability using unreliable commodity hardware. Hadoop also has a linear scaling, even when scaling up by orders of magnitude, giving Hadoop an advantage over other forms of distributed computing. By implementing GATE, a widely used Text Analytic tool, on the Hadoop framework using MapReduce, this project aims to enable Text Analysis on Big Data, with the linear scaling provided by the Hadoop framework. Performance analysis of the implementation shows that there is indeed a linear scaling when processing with increasing number of machines. As GATE can be used for a multitude of Text Analytic purposes, this implementation will allow the analysis of Big Data in many areas of interest. Bachelor of Engineering 2013-06-20T06:52:54Z 2013-06-20T06:52:54Z 2013 2013 Final Year Project (FYP) http://hdl.handle.net/10356/54449 en Nanyang Technological University 69 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing Ng, Wei Kok. Semantic text analytic services on the cloud |
description |
Text Analytics has applications in many areas. However, when Big Data is involved, there is a need to implement Text Analytic Services on the Cloud, due to the limitations of individual machines. Nonetheless, performing computation on large volumes of data is difficult. Even if an implementation works on 10 machines, to scale up to 100s or even 1000s of machines would require major changes to the implementation. The Hadoop framework ensures reliability and availability using unreliable commodity hardware. Hadoop also has a linear scaling, even when scaling up by orders of magnitude, giving Hadoop an advantage over other forms of distributed computing. By implementing GATE, a widely used Text Analytic tool, on the Hadoop framework using MapReduce, this project aims to enable Text Analysis on Big Data, with the linear scaling provided by the Hadoop framework. Performance analysis of the implementation shows that there is indeed a linear scaling when processing with increasing number of machines. As GATE can be used for a multitude of Text Analytic purposes, this implementation will allow the analysis of Big Data in many areas of interest. |
author2 |
Chan Chee Keong |
author_facet |
Chan Chee Keong Ng, Wei Kok. |
format |
Final Year Project |
author |
Ng, Wei Kok. |
author_sort |
Ng, Wei Kok. |
title |
Semantic text analytic services on the cloud |
title_short |
Semantic text analytic services on the cloud |
title_full |
Semantic text analytic services on the cloud |
title_fullStr |
Semantic text analytic services on the cloud |
title_full_unstemmed |
Semantic text analytic services on the cloud |
title_sort |
semantic text analytic services on the cloud |
publishDate |
2013 |
url |
http://hdl.handle.net/10356/54449 |
_version_ |
1772829072387211264 |