Hybrid pattern matching for complex ontology term recognition

Ontology term recognition is a key task of ontology-based text mining. Previous approaches of statistical analysis and syntactic pattern matching have such limitations that they do not consider relations between words and that their handcrafted patterns are expensive and show low coverage, respectiv...

Full description

Saved in:
Bibliographic Details
Main Authors: Kim, Jung-jae., Tuan, Luu Anh.
Other Authors: School of Computer Engineering
Format: Conference or Workshop Item
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/97427
http://hdl.handle.net/10220/11870
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-97427
record_format dspace
spelling sg-ntu-dr.10356-974272020-05-28T07:17:54Z Hybrid pattern matching for complex ontology term recognition Kim, Jung-jae. Tuan, Luu Anh. School of Computer Engineering Conference on Bioinformatics, Computational Biology and Biomedicine (2012 : Orlando, USA) DRNTU::Engineering::Computer science and engineering Ontology term recognition is a key task of ontology-based text mining. Previous approaches of statistical analysis and syntactic pattern matching have such limitations that they do not consider relations between words and that their handcrafted patterns are expensive and show low coverage, respectively. These limitations are critical especially when dealing with long and complex ontology terms. We propose a hybrid approach that combines the two approaches sequentially: It first uses syntactic pattern matching and, when its results are partial due to lack of required patterns, then completes them with supplementary evidence from a statistical method. Additionally, we present a novel method that automatically learns syntactic patterns from an annotated corpus. We tested the proposed approach for the tasks of recognizing Gene Ontology (GO) terms in text and also of associating the GO terms with proteins. When compared with existing systems of statistical analysis and syntactic pattern matching, it significantly improves 'relative' recall by 11%~13% and F-score by 7%. 2013-07-18T06:04:49Z 2019-12-06T19:42:40Z 2013-07-18T06:04:49Z 2019-12-06T19:42:40Z 2012 2012 Conference Paper Kim, J.-j., & Tuan, L. A. (2012). Hybrid pattern matching for complex ontology term recognition. Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine - BCB '12. https://hdl.handle.net/10356/97427 http://hdl.handle.net/10220/11870 10.1145/2382936.2382973 en © 2012 ACM.
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
spellingShingle DRNTU::Engineering::Computer science and engineering
Kim, Jung-jae.
Tuan, Luu Anh.
Hybrid pattern matching for complex ontology term recognition
description Ontology term recognition is a key task of ontology-based text mining. Previous approaches of statistical analysis and syntactic pattern matching have such limitations that they do not consider relations between words and that their handcrafted patterns are expensive and show low coverage, respectively. These limitations are critical especially when dealing with long and complex ontology terms. We propose a hybrid approach that combines the two approaches sequentially: It first uses syntactic pattern matching and, when its results are partial due to lack of required patterns, then completes them with supplementary evidence from a statistical method. Additionally, we present a novel method that automatically learns syntactic patterns from an annotated corpus. We tested the proposed approach for the tasks of recognizing Gene Ontology (GO) terms in text and also of associating the GO terms with proteins. When compared with existing systems of statistical analysis and syntactic pattern matching, it significantly improves 'relative' recall by 11%~13% and F-score by 7%.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Kim, Jung-jae.
Tuan, Luu Anh.
format Conference or Workshop Item
author Kim, Jung-jae.
Tuan, Luu Anh.
author_sort Kim, Jung-jae.
title Hybrid pattern matching for complex ontology term recognition
title_short Hybrid pattern matching for complex ontology term recognition
title_full Hybrid pattern matching for complex ontology term recognition
title_fullStr Hybrid pattern matching for complex ontology term recognition
title_full_unstemmed Hybrid pattern matching for complex ontology term recognition
title_sort hybrid pattern matching for complex ontology term recognition
publishDate 2013
url https://hdl.handle.net/10356/97427
http://hdl.handle.net/10220/11870
_version_ 1681058821693243392