Time expression extraction from free text

Today, extracting Time Expressions from free text remains a hot topic in Information Retrieval and Natural Processing Language Applications such as question answering systems. Even messaging applications such as WhatsApp are currently extracting time expressions to allow its users to add the dates i...

Full description

Saved in:

Bibliographic Details
Main Author:	Chua, Chin Aik
Other Authors:	Sun Aixin
Format:	Final Year Project
Language:	English
Published:	2018
Subjects:	DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval
Online Access:	http://hdl.handle.net/10356/74079
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-74079
record_format	dspace
spelling	sg-ntu-dr.10356-740792023-03-03T20:27:34Z Time expression extraction from free text Chua, Chin Aik Sun Aixin School of Computer Science and Engineering DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval Today, extracting Time Expressions from free text remains a hot topic in Information Retrieval and Natural Processing Language Applications such as question answering systems. Even messaging applications such as WhatsApp are currently extracting time expressions to allow its users to add the dates into the calendar. Extracting time expressions requires the ability to recognize and amend the format into a normalized form. Such systems are called temporal taggers. There are a few temporal taggers and different approaches available on the Internet. In this project, the existing temporal taggers will be evaluated against benchmarks. These benchmarks are datasets which are annotated. This experiment will provide a gauge of the best approach and tagger available on the Web. It will also show the limitations that the taggers have and provide a better solution to solve the limitations. Based on the experiment, we observe that rule-based taggers had better performance compared to the statistical systems. Also, we found out that most of the temporal taggers are unable to recognize colloquial words such as “nw” and “tmrw”. Such words are increasing in everyday use and it will be important for taggers to be able to recognize these words. Also, there were a few ambiguous terms such as a day’s march that taggers incorrectly normalize A few methods to solve the limitations will be shown in the implementation part of the report where the best temporal tagger will be extended. This will provide a description of how future algorithms could do to improve the limitations and performance. Bachelor of Engineering (Computer Science) 2018-04-24T05:33:27Z 2018-04-24T05:33:27Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/74079 en Nanyang Technological University 29 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval
spellingShingle	DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval Chua, Chin Aik Time expression extraction from free text
description	Today, extracting Time Expressions from free text remains a hot topic in Information Retrieval and Natural Processing Language Applications such as question answering systems. Even messaging applications such as WhatsApp are currently extracting time expressions to allow its users to add the dates into the calendar. Extracting time expressions requires the ability to recognize and amend the format into a normalized form. Such systems are called temporal taggers. There are a few temporal taggers and different approaches available on the Internet. In this project, the existing temporal taggers will be evaluated against benchmarks. These benchmarks are datasets which are annotated. This experiment will provide a gauge of the best approach and tagger available on the Web. It will also show the limitations that the taggers have and provide a better solution to solve the limitations. Based on the experiment, we observe that rule-based taggers had better performance compared to the statistical systems. Also, we found out that most of the temporal taggers are unable to recognize colloquial words such as “nw” and “tmrw”. Such words are increasing in everyday use and it will be important for taggers to be able to recognize these words. Also, there were a few ambiguous terms such as a day’s march that taggers incorrectly normalize A few methods to solve the limitations will be shown in the implementation part of the report where the best temporal tagger will be extended. This will provide a description of how future algorithms could do to improve the limitations and performance.
author2	Sun Aixin
author_facet	Sun Aixin Chua, Chin Aik
format	Final Year Project
author	Chua, Chin Aik
author_sort	Chua, Chin Aik
title	Time expression extraction from free text
title_short	Time expression extraction from free text
title_full	Time expression extraction from free text
title_fullStr	Time expression extraction from free text
title_full_unstemmed	Time expression extraction from free text
title_sort	time expression extraction from free text
publishDate	2018
url	http://hdl.handle.net/10356/74079
_version_	1759857385448407040

Time expression extraction from free text

Similar Items