Time expression recognition and normalization: a survey

Time information plays an important role in the areas of data mining, information retrieval, and natural language processing. Among the linguistic tasks related to time expressions, time expression recognition and normalization (TERN) is fundamental for other downstream tasks. Researchers from these...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhong, Xiaoshi, Cambria, Erik
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/168979
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-168979
record_format dspace
spelling sg-ntu-dr.10356-1689792023-06-26T02:48:31Z Time expression recognition and normalization: a survey Zhong, Xiaoshi Cambria, Erik School of Computer Science and Engineering Engineering::Computer science and engineering Information Extraction Time Expressions Time information plays an important role in the areas of data mining, information retrieval, and natural language processing. Among the linguistic tasks related to time expressions, time expression recognition and normalization (TERN) is fundamental for other downstream tasks. Researchers from these areas have devoted considerable effort in the last two decades to define the problem of time expression analysis, design the standards for time expression annotation, build annotated corpora for time expressions, and develop methods to identify time expressions from free text. While there are some surveys concerned with the development of time information extraction, retrieval, and reasoning, to the best of our knowledge, there is no survey focusing on the TERN development. We fill in this blank. In this survey, we review previous researches, aiming to draw an overview of the development of time expression analysis and discuss the role that time expressions play in different areas. We focus on the task of recognizing and normalizing time expressions from free text and investigate three kinds of methods that researchers develop for TERN, namely rule-based methods, traditional machine-learning methods, and deep-learning methods. We will also discuss some factors about TERN development, including TIMEX type factor, language factor, and domain and textual factors. After that, we list some useful datasets and softwares for both tasks of TER and TEN as well as TERN and finally outline some potential directions of future research. We hope that this survey can help those researchers who are interested in TERN quickly gain a comprehensive understanding of the development of TERN and its potential research directions. Agency for Science, Technology and Research (A*STAR) This research is supported by the Agency for Science, Technology and Research (A*STAR) under its AME Programmatic Funding Scheme (Project #A18A2b0046). 2023-06-26T02:48:31Z 2023-06-26T02:48:31Z 2023 Journal Article Zhong, X. & Cambria, E. (2023). Time expression recognition and normalization: a survey. Artificial Intelligence Review. https://dx.doi.org/10.1007/s10462-023-10400-y 0269-2821 https://hdl.handle.net/10356/168979 10.1007/s10462-023-10400-y 2-s2.0-85146820474 en A18A2b0046 Artificial Intelligence Review © The Author(s), under exclusive licence to Springer Nature B.V. 2023.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Information Extraction
Time Expressions
spellingShingle Engineering::Computer science and engineering
Information Extraction
Time Expressions
Zhong, Xiaoshi
Cambria, Erik
Time expression recognition and normalization: a survey
description Time information plays an important role in the areas of data mining, information retrieval, and natural language processing. Among the linguistic tasks related to time expressions, time expression recognition and normalization (TERN) is fundamental for other downstream tasks. Researchers from these areas have devoted considerable effort in the last two decades to define the problem of time expression analysis, design the standards for time expression annotation, build annotated corpora for time expressions, and develop methods to identify time expressions from free text. While there are some surveys concerned with the development of time information extraction, retrieval, and reasoning, to the best of our knowledge, there is no survey focusing on the TERN development. We fill in this blank. In this survey, we review previous researches, aiming to draw an overview of the development of time expression analysis and discuss the role that time expressions play in different areas. We focus on the task of recognizing and normalizing time expressions from free text and investigate three kinds of methods that researchers develop for TERN, namely rule-based methods, traditional machine-learning methods, and deep-learning methods. We will also discuss some factors about TERN development, including TIMEX type factor, language factor, and domain and textual factors. After that, we list some useful datasets and softwares for both tasks of TER and TEN as well as TERN and finally outline some potential directions of future research. We hope that this survey can help those researchers who are interested in TERN quickly gain a comprehensive understanding of the development of TERN and its potential research directions.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Zhong, Xiaoshi
Cambria, Erik
format Article
author Zhong, Xiaoshi
Cambria, Erik
author_sort Zhong, Xiaoshi
title Time expression recognition and normalization: a survey
title_short Time expression recognition and normalization: a survey
title_full Time expression recognition and normalization: a survey
title_fullStr Time expression recognition and normalization: a survey
title_full_unstemmed Time expression recognition and normalization: a survey
title_sort time expression recognition and normalization: a survey
publishDate 2023
url https://hdl.handle.net/10356/168979
_version_ 1772827091514949632