Group-shrinkage feature selection with a spatial network for mining DNA methylation data

Identifying disease-related biomarkers from high-dimensional DNA methylation data helps in reducing early screening costs and inferring pathogenesis mechanisms. Good discovery results have been achieved through spatial correlation methods of methylation sites, group-based regularization, and network...

Full description

Saved in:
Bibliographic Details
Main Authors: Tang, Xinlu, Mo, Zhanfeng, Chang, Cheng, Qian, Xiaohua
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/170639
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-170639
record_format dspace
spelling sg-ntu-dr.10356-1706392023-09-25T02:36:47Z Group-shrinkage feature selection with a spatial network for mining DNA methylation data Tang, Xinlu Mo, Zhanfeng Chang, Cheng Qian, Xiaohua School of Computer Science and Engineering Engineering::Computer science and engineering DNA Methylation Feature Selection Identifying disease-related biomarkers from high-dimensional DNA methylation data helps in reducing early screening costs and inferring pathogenesis mechanisms. Good discovery results have been achieved through spatial correlation methods of methylation sites, group-based regularization, and network constraints. However, these methods still have some key limitations as they cannot exclude isolated differential sites and only consider adjacent site ordering. Therefore, we propose a group-shrinkage feature selection algorithm to encourage the selection of clustered sites and discourage the selection of isolated differential sites. Specifically, a network-guided group-shrinkage strategy is developed to penalize weakly-correlated isolated methylation sites through a network structure constraint. The spatial network is constructed based on spatial correlation information of DNA methylation sites, where this information accounts for the uneven site distribution. The experimental simulations and applications demonstrated that the proposed method outperforms the advanced regularization methods, especially in rejecting isolated methylation sites; hence this study provides an efficient and clinical-valuable method for biomarker candidate discovery in DNA methylation data. Additionally, the proposed method exhibits enhanced reliability due to introducing biological prior knowledge into a regularization-based feature selection framework and could promote more research in the integration between biological prior knowledge and classical feature selection methods, thus facilitating their clinical application. Our source codes will be released at https://github.com/SJTUBME-QianLab/Group-shrinkage-Spatial-Network once this manuscript is accepted for publication. 2023-09-25T02:36:47Z 2023-09-25T02:36:47Z 2023 Journal Article Tang, X., Mo, Z., Chang, C. & Qian, X. (2023). Group-shrinkage feature selection with a spatial network for mining DNA methylation data. Computers in Biology and Medicine, 154, 106573-. https://dx.doi.org/10.1016/j.compbiomed.2023.106573 0010-4825 https://hdl.handle.net/10356/170639 10.1016/j.compbiomed.2023.106573 36706568 2-s2.0-85147326888 154 106573 en Computers in Biology and Medicine © 2023 Elsevier Ltd. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
DNA Methylation
Feature Selection
spellingShingle Engineering::Computer science and engineering
DNA Methylation
Feature Selection
Tang, Xinlu
Mo, Zhanfeng
Chang, Cheng
Qian, Xiaohua
Group-shrinkage feature selection with a spatial network for mining DNA methylation data
description Identifying disease-related biomarkers from high-dimensional DNA methylation data helps in reducing early screening costs and inferring pathogenesis mechanisms. Good discovery results have been achieved through spatial correlation methods of methylation sites, group-based regularization, and network constraints. However, these methods still have some key limitations as they cannot exclude isolated differential sites and only consider adjacent site ordering. Therefore, we propose a group-shrinkage feature selection algorithm to encourage the selection of clustered sites and discourage the selection of isolated differential sites. Specifically, a network-guided group-shrinkage strategy is developed to penalize weakly-correlated isolated methylation sites through a network structure constraint. The spatial network is constructed based on spatial correlation information of DNA methylation sites, where this information accounts for the uneven site distribution. The experimental simulations and applications demonstrated that the proposed method outperforms the advanced regularization methods, especially in rejecting isolated methylation sites; hence this study provides an efficient and clinical-valuable method for biomarker candidate discovery in DNA methylation data. Additionally, the proposed method exhibits enhanced reliability due to introducing biological prior knowledge into a regularization-based feature selection framework and could promote more research in the integration between biological prior knowledge and classical feature selection methods, thus facilitating their clinical application. Our source codes will be released at https://github.com/SJTUBME-QianLab/Group-shrinkage-Spatial-Network once this manuscript is accepted for publication.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Tang, Xinlu
Mo, Zhanfeng
Chang, Cheng
Qian, Xiaohua
format Article
author Tang, Xinlu
Mo, Zhanfeng
Chang, Cheng
Qian, Xiaohua
author_sort Tang, Xinlu
title Group-shrinkage feature selection with a spatial network for mining DNA methylation data
title_short Group-shrinkage feature selection with a spatial network for mining DNA methylation data
title_full Group-shrinkage feature selection with a spatial network for mining DNA methylation data
title_fullStr Group-shrinkage feature selection with a spatial network for mining DNA methylation data
title_full_unstemmed Group-shrinkage feature selection with a spatial network for mining DNA methylation data
title_sort group-shrinkage feature selection with a spatial network for mining dna methylation data
publishDate 2023
url https://hdl.handle.net/10356/170639
_version_ 1779156307297697792