GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction
Unravelling gene expression has become a critical procedure in bioinformatics world today and required continuous efforts to form a complete picture of enhancers. Enhancers are explicit patterns of gene expression that bound by activators to stimulate transcription. It could reside in upstream or do...
Saved in:
Main Authors: | , , |
---|---|
Format: | E-Article |
Language: | English |
Published: |
ACM New York, NY, USA
2016
|
Subjects: | |
Online Access: | http://ir.unimas.my/id/eprint/16666/1/GMFR-CNN%20%28abstract%29.pdf http://ir.unimas.my/id/eprint/16666/ http://dl.acm.org/citation.cfm?id=3029380 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Malaysia Sarawak |
Language: | English |
id |
my.unimas.ir.16666 |
---|---|
record_format |
eprints |
spelling |
my.unimas.ir.166662017-06-15T07:03:04Z http://ir.unimas.my/id/eprint/16666/ GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction Yu, Shiong Wong Nung, Kion Lee Norshafarina, Omar T Technology (General) Unravelling gene expression has become a critical procedure in bioinformatics world today and required continuous efforts to form a complete picture of enhancers. Enhancers are explicit patterns of gene expression that bound by activators to stimulate transcription. It could reside in upstream or downstream thousands of base pairs away without any fixed position. Therefore, the identification task of enhancers is extremely challenging. The inclusion of gaps in motif identification improved the overall accuracy and sensitivity, however, this feature is not fully utilised in deep learning method yet. Deep learning, is a powerful machine learning technique that has been actively used in image recognition and this technique has begun to shed light in bioinformatics. The expressiveness of deep learning enables higher feature learning from lower level ones. As a result, an integration of gapped motif feature representation (GMFR) and deep learning approach called deep convolutional neural networks (CNNs) is introduced to improve the predictive power of enhancer prediction. We called this method as GMFR-CNN. Comparative studies indicate that GMFR-CNN outperforms the other deep learning and gapped k-mer SVM tools with average 98% prediction accuracy. Breakthrough in deep learning technique certainly improves the performance in the near future. ACM New York, NY, USA 2016 E-Article PeerReviewed text en http://ir.unimas.my/id/eprint/16666/1/GMFR-CNN%20%28abstract%29.pdf Yu, Shiong Wong and Nung, Kion Lee and Norshafarina, Omar (2016) GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction. CSBio '16 Proceedings of the 7th International Conference on Computational Systems-Biology and Bioinformatics. ISSN ISBN: 978-1-4503-4794-5 http://dl.acm.org/citation.cfm?id=3029380 10.1145/3029375.3029380 |
institution |
Universiti Malaysia Sarawak |
building |
Centre for Academic Information Services (CAIS) |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaysia Sarawak |
content_source |
UNIMAS Institutional Repository |
url_provider |
http://ir.unimas.my/ |
language |
English |
topic |
T Technology (General) |
spellingShingle |
T Technology (General) Yu, Shiong Wong Nung, Kion Lee Norshafarina, Omar GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction |
description |
Unravelling gene expression has become a critical procedure in bioinformatics world today and required continuous efforts to form a complete picture of enhancers. Enhancers are explicit patterns of gene expression that bound by activators to stimulate transcription. It could reside in upstream or downstream thousands of base pairs away without any fixed position. Therefore, the identification task of enhancers is extremely challenging. The inclusion of gaps in motif identification improved the overall accuracy and sensitivity, however, this feature is not fully utilised in deep learning method yet. Deep learning, is a powerful machine learning technique that has been actively used in image recognition and this technique has begun to shed light in bioinformatics. The expressiveness of deep learning enables higher feature learning from lower level ones. As a result, an integration of gapped motif feature representation (GMFR) and deep learning approach called deep convolutional neural networks (CNNs) is introduced to improve the predictive power of enhancer prediction. We called this method as GMFR-CNN. Comparative studies indicate that GMFR-CNN outperforms the other deep learning and gapped k-mer SVM tools with average 98% prediction accuracy. Breakthrough in deep learning technique certainly improves the performance in the near future. |
format |
E-Article |
author |
Yu, Shiong Wong Nung, Kion Lee Norshafarina, Omar |
author_facet |
Yu, Shiong Wong Nung, Kion Lee Norshafarina, Omar |
author_sort |
Yu, Shiong Wong |
title |
GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction |
title_short |
GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction |
title_full |
GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction |
title_fullStr |
GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction |
title_full_unstemmed |
GMFR-CNN: An Integration of Gapped Motif Feature Representation and Deep Learning Approach for Enhancer Prediction |
title_sort |
gmfr-cnn: an integration of gapped motif feature representation and deep learning approach for enhancer prediction |
publisher |
ACM New York, NY, USA |
publishDate |
2016 |
url |
http://ir.unimas.my/id/eprint/16666/1/GMFR-CNN%20%28abstract%29.pdf http://ir.unimas.my/id/eprint/16666/ http://dl.acm.org/citation.cfm?id=3029380 |
_version_ |
1644512424011759616 |