Text mining for building proteins interaction networks

Text mining algorithm is an important method for extracting information from biomedical literatures. Precious text mining algorithms are not specific to biological domain. Our purpose is to find an algorithm that is most suitable for biological domain. Changes and improvements have been done to the...

Full description

Saved in:

Bibliographic Details
Main Author:	Chen, Lucan.
Other Authors:	Rajapakse Jagath Chandana
Format:	Final Year Project
Language:	English
Published:	2012
Subjects:	DRNTU::Engineering::Computer science and engineering::Computer applications::Life and medical sciences DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing
Online Access:	http://hdl.handle.net/10356/49070
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-49070
record_format	dspace
spelling	sg-ntu-dr.10356-490702023-03-03T20:44:53Z Text mining for building proteins interaction networks Chen, Lucan. Rajapakse Jagath Chandana School of Computer Engineering Bioinformatics Research Centre DRNTU::Engineering::Computer science and engineering::Computer applications::Life and medical sciences DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing Text mining algorithm is an important method for extracting information from biomedical literatures. Precious text mining algorithms are not specific to biological domain. Our purpose is to find an algorithm that is most suitable for biological domain. Changes and improvements have been done to the existing algorithms. A new algorithm is also designed. The existing text mining algorithms are investigated and implemented first. Pattern Matching Algorithm is a commonly-used straightforward algorithm. Results from our implementation show that the performance is not good enough. A new algorithm, Terms Association Algorithm is therefore designed and implemented. Results of Terms Association Algorithm show that it’s suitable for English biomedical literature. Its performance is better than the existing text mining algorithm, especially under biological domain. After determining the best text mining algorithm for biological domain, the text mining algorithm was integrated together with OSEE, KEGG pathways and IntAct to construct gene regulation networks and protein-protein interaction networks. The overall performance of the constructed networks is investigated. Our new text mining algorithm hasshow in constructing biological networks. Bachelor of Engineering (Computer Engineering) 2012-05-14T08:00:24Z 2012-05-14T08:00:24Z 2012 2012 Final Year Project (FYP) http://hdl.handle.net/10356/49070 en Nanyang Technological University 49 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering::Computer applications::Life and medical sciences DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing
spellingShingle	DRNTU::Engineering::Computer science and engineering::Computer applications::Life and medical sciences DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing Chen, Lucan. Text mining for building proteins interaction networks
description	Text mining algorithm is an important method for extracting information from biomedical literatures. Precious text mining algorithms are not specific to biological domain. Our purpose is to find an algorithm that is most suitable for biological domain. Changes and improvements have been done to the existing algorithms. A new algorithm is also designed. The existing text mining algorithms are investigated and implemented first. Pattern Matching Algorithm is a commonly-used straightforward algorithm. Results from our implementation show that the performance is not good enough. A new algorithm, Terms Association Algorithm is therefore designed and implemented. Results of Terms Association Algorithm show that it’s suitable for English biomedical literature. Its performance is better than the existing text mining algorithm, especially under biological domain. After determining the best text mining algorithm for biological domain, the text mining algorithm was integrated together with OSEE, KEGG pathways and IntAct to construct gene regulation networks and protein-protein interaction networks. The overall performance of the constructed networks is investigated. Our new text mining algorithm hasshow in constructing biological networks.
author2	Rajapakse Jagath Chandana
author_facet	Rajapakse Jagath Chandana Chen, Lucan.
format	Final Year Project
author	Chen, Lucan.
author_sort	Chen, Lucan.
title	Text mining for building proteins interaction networks
title_short	Text mining for building proteins interaction networks
title_full	Text mining for building proteins interaction networks
title_fullStr	Text mining for building proteins interaction networks
title_full_unstemmed	Text mining for building proteins interaction networks
title_sort	text mining for building proteins interaction networks
publishDate	2012
url	http://hdl.handle.net/10356/49070
_version_	1759854479009644544

Text mining for building proteins interaction networks

Similar Items