Integration of genomic and epigenomic features to predict meiotic recombination hotspots in human and mouse

The regulatory mechanism of meiotic recombination hotspots is a fundamental problem in biology, with broad impacts on areas ranging from disease study to evolution. Recently, many genomic and epigenomic features have been associated with recombination hotspots, but none of them can explain hotspots...

Full description

Saved in:
Bibliographic Details
Main Authors: Przytycka, Teresa M., Wu, Min, Kwoh, Chee Keong, Li, Jing, Zheng, Jie
Other Authors: School of Computer Engineering
Format: Conference or Workshop Item
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/97678
http://hdl.handle.net/10220/11871
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:The regulatory mechanism of meiotic recombination hotspots is a fundamental problem in biology, with broad impacts on areas ranging from disease study to evolution. Recently, many genomic and epigenomic features have been associated with recombination hotspots, but none of them can explain hotspots consistently. It is highly desirable to integrate the different features into a predictive model, and study the relation of the features with hotspots and themselves with a systems approach. Moreover, due to rapid and dynamic evolution of recombination hotspots, regulatory mechanisms of hotspots that are evolutionarily conserved among species remain unclear. We propose a machine learning approach that encode genomic and epigenomic features into a support vector machine (SVM). Trained on known hotspots and coldspots in human and mouse genomes, the model is able to predict hotspots based on the features with good performance in both species. Moreover, the model reports a ranking of feature importance, uncovering the interactions of the features with hotspots and themselves. Applying the method to large-scale data, we identified evolutionarily conserved patterns of trans-regulators and feature importance between human and mouse hotspots. This is the first attempt to build a predictive model to identify evolutionarily conserved mechanisms for recombination hotspots by integrating both genomic and epigenomic features.