Converting a text mining system into a UIMA framework

The UIMA Framework aids in discovering knowledge from unstructured information by coordinating Analysis Engines. This suggests that Analysis Engines are independent modules which are shielded from the integration details, and they can focus on their own functions. A known Natural Language Processing...

Full description

Saved in:
Bibliographic Details
Main Author: Choo, Zhen Ying.
Other Authors: School of Computer Engineering
Format: Final Year Project
Language:English
Published: 2013
Subjects:
Online Access:http://hdl.handle.net/10356/55027
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-55027
record_format dspace
spelling sg-ntu-dr.10356-550272023-03-03T20:30:03Z Converting a text mining system into a UIMA framework Choo, Zhen Ying. School of Computer Engineering Kim Jung-Jae DRNTU::Engineering::Computer science and engineering::Information systems::Information systems applications The UIMA Framework aids in discovering knowledge from unstructured information by coordinating Analysis Engines. This suggests that Analysis Engines are independent modules which are shielded from the integration details, and they can focus on their own functions. A known Natural Language Processing system, U-Compare, is also utilizing the UIMA Framework. Parts of an existing Ontology-driven Software Engineering Environment (OSEE) pattern matching system that infers implied events from direct events from biomedical text, are converted into the UIMA structure and deployed into U-Compare system in this project to employ the benefits of the UIMA Framework. These parts include converting a Named Entity Recognition (NER) module and parsing the input sentences to obtain their semantic structures, as well as a Pattern Matching module to identify the relations between the named entities. A separate parser to generate the dictionary file for Chemical terms was also modified and utilized. Converting these into the UIMA structure eases distributability of the OSEE system to a larger community. Further enhancements can be made in the future to automate the creation of Annotation types according to those in a user-specified settings file. A Graphic User Interface could also be developed to allow users to select the required files for the OSEE system, and the parallel processing of tasks in the OSEE system that are independent of each other could be further explored. Bachelor of Engineering (Computer Science) 2013-12-04T01:12:47Z 2013-12-04T01:12:47Z 2013 2013 Final Year Project (FYP) http://hdl.handle.net/10356/55027 en Nanyang Technological University 54 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Information systems::Information systems applications
spellingShingle DRNTU::Engineering::Computer science and engineering::Information systems::Information systems applications
Choo, Zhen Ying.
Converting a text mining system into a UIMA framework
description The UIMA Framework aids in discovering knowledge from unstructured information by coordinating Analysis Engines. This suggests that Analysis Engines are independent modules which are shielded from the integration details, and they can focus on their own functions. A known Natural Language Processing system, U-Compare, is also utilizing the UIMA Framework. Parts of an existing Ontology-driven Software Engineering Environment (OSEE) pattern matching system that infers implied events from direct events from biomedical text, are converted into the UIMA structure and deployed into U-Compare system in this project to employ the benefits of the UIMA Framework. These parts include converting a Named Entity Recognition (NER) module and parsing the input sentences to obtain their semantic structures, as well as a Pattern Matching module to identify the relations between the named entities. A separate parser to generate the dictionary file for Chemical terms was also modified and utilized. Converting these into the UIMA structure eases distributability of the OSEE system to a larger community. Further enhancements can be made in the future to automate the creation of Annotation types according to those in a user-specified settings file. A Graphic User Interface could also be developed to allow users to select the required files for the OSEE system, and the parallel processing of tasks in the OSEE system that are independent of each other could be further explored.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Choo, Zhen Ying.
format Final Year Project
author Choo, Zhen Ying.
author_sort Choo, Zhen Ying.
title Converting a text mining system into a UIMA framework
title_short Converting a text mining system into a UIMA framework
title_full Converting a text mining system into a UIMA framework
title_fullStr Converting a text mining system into a UIMA framework
title_full_unstemmed Converting a text mining system into a UIMA framework
title_sort converting a text mining system into a uima framework
publishDate 2013
url http://hdl.handle.net/10356/55027
_version_ 1759857899675320320