Term recognition from electronic medical records of Singaporean hospitals

Unlike other Medical text, Electronic Medical Records (EMRs) are written by doctors in a clinical setting. As EMRs are relatively new themselves, they lack well defined standards for input and doctors usually write them in free-form dictation These form of text lack proper sentence structure and con...

Full description

Saved in:
Bibliographic Details
Main Author: Chai, De Ren
Other Authors: Kim Jung-Jae
Format: Final Year Project
Language:English
Published: 2015
Subjects:
Online Access:http://hdl.handle.net/10356/62808
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-62808
record_format dspace
spelling sg-ntu-dr.10356-628082023-03-03T20:50:39Z Term recognition from electronic medical records of Singaporean hospitals Chai, De Ren Kim Jung-Jae School of Computer Engineering Khoo Teck Puat Hospital DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition Unlike other Medical text, Electronic Medical Records (EMRs) are written by doctors in a clinical setting. As EMRs are relatively new themselves, they lack well defined standards for input and doctors usually write them in free-form dictation These form of text lack proper sentence structure and contain numerous acronyms that only appear in such context, making it difficult to apply current parsing techniques to discover semantic meanings of the text. In this project, methods to identify unknown acronym terms in EMRs and disambiguate their meaning are explored and developed. To better apply automated processes, the EMRs are first pre-processed to restructure their text into a more defined format. Finally these unknown acronym terms are marked up for future use. Results obtained from the formatting of the EMRs showed a precision of 100% and a recall of 67.93%. Initial extraction of acronym-value pairs from the EMRs resulted in 2329 extracted terms with 68.14% true positives, the implementation of an exclusion list, consisting of recurring false positives increased the true positives to 90.99%. While the automated disambiguation process of these true positives could not be implemented, distinct patterns for 31 acronyms were identified which can be used to disambiguate their occurrence in EMRs. Bachelor of Engineering (Computer Science) 2015-04-29T04:48:47Z 2015-04-29T04:48:47Z 2015 2015 Final Year Project (FYP) http://hdl.handle.net/10356/62808 en Nanyang Technological University 52 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
Chai, De Ren
Term recognition from electronic medical records of Singaporean hospitals
description Unlike other Medical text, Electronic Medical Records (EMRs) are written by doctors in a clinical setting. As EMRs are relatively new themselves, they lack well defined standards for input and doctors usually write them in free-form dictation These form of text lack proper sentence structure and contain numerous acronyms that only appear in such context, making it difficult to apply current parsing techniques to discover semantic meanings of the text. In this project, methods to identify unknown acronym terms in EMRs and disambiguate their meaning are explored and developed. To better apply automated processes, the EMRs are first pre-processed to restructure their text into a more defined format. Finally these unknown acronym terms are marked up for future use. Results obtained from the formatting of the EMRs showed a precision of 100% and a recall of 67.93%. Initial extraction of acronym-value pairs from the EMRs resulted in 2329 extracted terms with 68.14% true positives, the implementation of an exclusion list, consisting of recurring false positives increased the true positives to 90.99%. While the automated disambiguation process of these true positives could not be implemented, distinct patterns for 31 acronyms were identified which can be used to disambiguate their occurrence in EMRs.
author2 Kim Jung-Jae
author_facet Kim Jung-Jae
Chai, De Ren
format Final Year Project
author Chai, De Ren
author_sort Chai, De Ren
title Term recognition from electronic medical records of Singaporean hospitals
title_short Term recognition from electronic medical records of Singaporean hospitals
title_full Term recognition from electronic medical records of Singaporean hospitals
title_fullStr Term recognition from electronic medical records of Singaporean hospitals
title_full_unstemmed Term recognition from electronic medical records of Singaporean hospitals
title_sort term recognition from electronic medical records of singaporean hospitals
publishDate 2015
url http://hdl.handle.net/10356/62808
_version_ 1759857223431880704