BioDARA: data summarization approach to extracting bio-medical structuring information

Problem statement: Due to the ever growing amount of biomedical datasets stored in multiple tables, Information Extraction (IE) from these datasets is increasingly recognized as one of the crucial technologies in bioinformatics. However, for IE to be practically applicable, adaptability of a system...

Full description

Saved in:
Bibliographic Details
Main Authors: Chung Seng Kheau, Rayner Alfred, Joe Henry Obit
Format: Article
Language:English
English
Published: Science Publications 2011
Subjects:
Online Access:https://eprints.ums.edu.my/id/eprint/29060/1/BioDARA_data%20summarization%20approach%20to%20extracting%20bio-medical%20structuring%20information%20ABSTRACT.pdf
https://eprints.ums.edu.my/id/eprint/29060/2/BioDARA_%20data%20summarization%20approach%20to%20extracting%20bio-medical%20structuring%20information%20FULL%20TEXT.pdf
https://eprints.ums.edu.my/id/eprint/29060/
http://thescipub.com/abstract/10.3844/jcssp.2011.1914.1920
https://doi.org/10.3844/jcssp.2011.1914.1920
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Sabah
Language: English
English
id my.ums.eprints.29060
record_format eprints
spelling my.ums.eprints.290602021-09-20T01:51:06Z https://eprints.ums.edu.my/id/eprint/29060/ BioDARA: data summarization approach to extracting bio-medical structuring information Chung Seng Kheau Rayner Alfred Joe Henry Obit R856-857 Biomedical engineering. Electronics. Instrumentation Problem statement: Due to the ever growing amount of biomedical datasets stored in multiple tables, Information Extraction (IE) from these datasets is increasingly recognized as one of the crucial technologies in bioinformatics. However, for IE to be practically applicable, adaptability of a system is crucial, considering extremely diverse demands in biomedical IE application. One should be able to extract a set of hidden patterns from these biomedical datasets at low cost. Approach: In this study, a new method is proposed, called Bio-medical Data Aggregation for Relational Attributes (BioDARA), for automatic structuring information extraction for biomedical datasets. BioDARA summarizes biomedical data stored in multiple tables in order to facilitate data modeling efforts in a multi-relational setting. BioDARA has the advantages or capabilities to transform biomedical data stored in multiple tables or databases into a Vector Space model, summarize biomedical data using the Information Retrieval theory and finally extract frequent patterns that describe the characteristics of these biomedical datasets. Results: the results show that data summarization performed by DARA, can be beneficial in summarizing biomedical datasets in a complex multi-relational environment, in which biomedical datasets are stored in a multi-level of one-to-many relationships and also in the case of datasets stored in more than one one-to-many relationships with non-target tables. Conclusion: This study concludes that data summarization performed by BioDARA, can be beneficial in summarizing biomedical datasets in a complex multi-relational environment, in which biomedical datasets are stored in a multi-level of one-to-many relationships. Science Publications 2011 Article PeerReviewed text en https://eprints.ums.edu.my/id/eprint/29060/1/BioDARA_data%20summarization%20approach%20to%20extracting%20bio-medical%20structuring%20information%20ABSTRACT.pdf text en https://eprints.ums.edu.my/id/eprint/29060/2/BioDARA_%20data%20summarization%20approach%20to%20extracting%20bio-medical%20structuring%20information%20FULL%20TEXT.pdf Chung Seng Kheau and Rayner Alfred and Joe Henry Obit (2011) BioDARA: data summarization approach to extracting bio-medical structuring information. Journal of Computer Science, 7. pp. 1914-1920. ISSN 1549-3636 (P-ISSN) ,1552-6607 (E-ISSN) http://thescipub.com/abstract/10.3844/jcssp.2011.1914.1920 https://doi.org/10.3844/jcssp.2011.1914.1920
institution Universiti Malaysia Sabah
building UMS Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Sabah
content_source UMS Institutional Repository
url_provider http://eprints.ums.edu.my/
language English
English
topic R856-857 Biomedical engineering. Electronics. Instrumentation
spellingShingle R856-857 Biomedical engineering. Electronics. Instrumentation
Chung Seng Kheau
Rayner Alfred
Joe Henry Obit
BioDARA: data summarization approach to extracting bio-medical structuring information
description Problem statement: Due to the ever growing amount of biomedical datasets stored in multiple tables, Information Extraction (IE) from these datasets is increasingly recognized as one of the crucial technologies in bioinformatics. However, for IE to be practically applicable, adaptability of a system is crucial, considering extremely diverse demands in biomedical IE application. One should be able to extract a set of hidden patterns from these biomedical datasets at low cost. Approach: In this study, a new method is proposed, called Bio-medical Data Aggregation for Relational Attributes (BioDARA), for automatic structuring information extraction for biomedical datasets. BioDARA summarizes biomedical data stored in multiple tables in order to facilitate data modeling efforts in a multi-relational setting. BioDARA has the advantages or capabilities to transform biomedical data stored in multiple tables or databases into a Vector Space model, summarize biomedical data using the Information Retrieval theory and finally extract frequent patterns that describe the characteristics of these biomedical datasets. Results: the results show that data summarization performed by DARA, can be beneficial in summarizing biomedical datasets in a complex multi-relational environment, in which biomedical datasets are stored in a multi-level of one-to-many relationships and also in the case of datasets stored in more than one one-to-many relationships with non-target tables. Conclusion: This study concludes that data summarization performed by BioDARA, can be beneficial in summarizing biomedical datasets in a complex multi-relational environment, in which biomedical datasets are stored in a multi-level of one-to-many relationships.
format Article
author Chung Seng Kheau
Rayner Alfred
Joe Henry Obit
author_facet Chung Seng Kheau
Rayner Alfred
Joe Henry Obit
author_sort Chung Seng Kheau
title BioDARA: data summarization approach to extracting bio-medical structuring information
title_short BioDARA: data summarization approach to extracting bio-medical structuring information
title_full BioDARA: data summarization approach to extracting bio-medical structuring information
title_fullStr BioDARA: data summarization approach to extracting bio-medical structuring information
title_full_unstemmed BioDARA: data summarization approach to extracting bio-medical structuring information
title_sort biodara: data summarization approach to extracting bio-medical structuring information
publisher Science Publications
publishDate 2011
url https://eprints.ums.edu.my/id/eprint/29060/1/BioDARA_data%20summarization%20approach%20to%20extracting%20bio-medical%20structuring%20information%20ABSTRACT.pdf
https://eprints.ums.edu.my/id/eprint/29060/2/BioDARA_%20data%20summarization%20approach%20to%20extracting%20bio-medical%20structuring%20information%20FULL%20TEXT.pdf
https://eprints.ums.edu.my/id/eprint/29060/
http://thescipub.com/abstract/10.3844/jcssp.2011.1914.1920
https://doi.org/10.3844/jcssp.2011.1914.1920
_version_ 1760230665576513536