GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction

Unravelling gene expression has become a critical procedure in bioinformatics world today and required continuous efforts to form a complete picture of enhancers. Enhancers are explicit patterns of gene expression that bound by activators to stimulate transcription. It could reside in upstream or do...

Full description

Saved in:
Bibliographic Details
Main Authors: Yu, Shiong Wong, Nung, Kion Lee, Norshafarina, Omar
Format: E-Article
Language:English
Published: ACM New York, NY, USA 2016
Subjects:
Online Access:http://ir.unimas.my/id/eprint/16666/1/GMFR-CNN%20%28abstract%29.pdf
http://ir.unimas.my/id/eprint/16666/
http://dl.acm.org/citation.cfm?id=3029380
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Sarawak
Language: English
id my.unimas.ir.16666
record_format eprints
spelling my.unimas.ir.166662017-06-15T07:03:04Z http://ir.unimas.my/id/eprint/16666/ GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction Yu, Shiong Wong Nung, Kion Lee Norshafarina, Omar T Technology (General) Unravelling gene expression has become a critical procedure in bioinformatics world today and required continuous efforts to form a complete picture of enhancers. Enhancers are explicit patterns of gene expression that bound by activators to stimulate transcription. It could reside in upstream or downstream thousands of base pairs away without any fixed position. Therefore, the identification task of enhancers is extremely challenging. The inclusion of gaps in motif identification improved the overall accuracy and sensitivity, however, this feature is not fully utilised in deep learning method yet. Deep learning, is a powerful machine learning technique that has been actively used in image recognition and this technique has begun to shed light in bioinformatics. The expressiveness of deep learning enables higher feature learning from lower level ones. As a result, an integration of gapped motif feature representation (GMFR) and deep learning approach called deep convolutional neural networks (CNNs) is introduced to improve the predictive power of enhancer prediction. We called this method as GMFR-CNN. Comparative studies indicate that GMFR-CNN outperforms the other deep learning and gapped k-mer SVM tools with average 98% prediction accuracy. Breakthrough in deep learning technique certainly improves the performance in the near future. ACM New York, NY, USA 2016 E-Article PeerReviewed text en http://ir.unimas.my/id/eprint/16666/1/GMFR-CNN%20%28abstract%29.pdf Yu, Shiong Wong and Nung, Kion Lee and Norshafarina, Omar (2016) GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction. CSBio '16 Proceedings of the 7th International Conference on Computational Systems-Biology and Bioinformatics. ISSN ISBN: 978-1-4503-4794-5 http://dl.acm.org/citation.cfm?id=3029380 10.1145/3029375.3029380
institution Universiti Malaysia Sarawak
building Centre for Academic Information Services (CAIS)
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Sarawak
content_source UNIMAS Institutional Repository
url_provider http://ir.unimas.my/
language English
topic T Technology (General)
spellingShingle T Technology (General)
Yu, Shiong Wong
Nung, Kion Lee
Norshafarina, Omar
GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction
description Unravelling gene expression has become a critical procedure in bioinformatics world today and required continuous efforts to form a complete picture of enhancers. Enhancers are explicit patterns of gene expression that bound by activators to stimulate transcription. It could reside in upstream or downstream thousands of base pairs away without any fixed position. Therefore, the identification task of enhancers is extremely challenging. The inclusion of gaps in motif identification improved the overall accuracy and sensitivity, however, this feature is not fully utilised in deep learning method yet. Deep learning, is a powerful machine learning technique that has been actively used in image recognition and this technique has begun to shed light in bioinformatics. The expressiveness of deep learning enables higher feature learning from lower level ones. As a result, an integration of gapped motif feature representation (GMFR) and deep learning approach called deep convolutional neural networks (CNNs) is introduced to improve the predictive power of enhancer prediction. We called this method as GMFR-CNN. Comparative studies indicate that GMFR-CNN outperforms the other deep learning and gapped k-mer SVM tools with average 98% prediction accuracy. Breakthrough in deep learning technique certainly improves the performance in the near future.
format E-Article
author Yu, Shiong Wong
Nung, Kion Lee
Norshafarina, Omar
author_facet Yu, Shiong Wong
Nung, Kion Lee
Norshafarina, Omar
author_sort Yu, Shiong Wong
title GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction
title_short GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction
title_full GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction
title_fullStr GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction
title_full_unstemmed GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction
title_sort gmfr-cnn: an integration of gapped motif feature representation and deep learning approach for enhancer prediction
publisher ACM New York, NY, USA
publishDate 2016
url http://ir.unimas.my/id/eprint/16666/1/GMFR-CNN%20%28abstract%29.pdf
http://ir.unimas.my/id/eprint/16666/
http://dl.acm.org/citation.cfm?id=3029380
_version_ 1644512424011759616