Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data

Approaches based on linear mixed models (LMMs) have recently gained popularity for modelling population substructure and relatedness in genome-wide association studies. In the last few years, a bewildering variety of different LMM methods/software packages have been developed, but it is not always c...

Full description

Saved in:
Bibliographic Details
Main Authors: Jakris Eu-ahsunthornwattana, E. Nancy Miller, Michaela Fakiola, Selma M.B. Jeronimo, Jenefer M. Blackwell, Heather J. Cordell
Other Authors: Newcastle University, United Kingdom
Format: Article
Published: 2018
Subjects:
Online Access:https://repository.li.mahidol.ac.th/handle/123456789/33170
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Mahidol University
id th-mahidol.33170
record_format dspace
spelling th-mahidol.331702018-11-09T10:04:37Z Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data Jakris Eu-ahsunthornwattana E. Nancy Miller Michaela Fakiola Selma M.B. Jeronimo Jenefer M. Blackwell Heather J. Cordell Newcastle University, United Kingdom Mahidol University Cambridge Institute for Medical Research Universidade Federal do Rio Grande do Norte Telethon Kids Institute Agricultural and Biological Sciences Biochemistry, Genetics and Molecular Biology Medicine Approaches based on linear mixed models (LMMs) have recently gained popularity for modelling population substructure and relatedness in genome-wide association studies. In the last few years, a bewildering variety of different LMM methods/software packages have been developed, but it is not always clear how (or indeed whether) any newly-proposed method differs from previously-proposed implementations. Here we compare the performance of several LMM approaches (and software implementations, including EMMAX, GenABEL, FaST-LMM, Mendel, GEMMA and MMM) via their application to a genome-wide association study of visceral leishmaniasis in 348 Brazilian families comprising 3626 individuals (1972 genotyped). The implementations differ in precise details of methodology implemented and through various user-chosen options such as the method and number of SNPs used to estimate the kinship (relatedness) matrix. We investigate sensitivity to these choices and the success (or otherwise) of the approaches in controlling the overall genome-wide error-rate for both real and simulated phenotypes. We compare the LMM results to those obtained using traditional family-based association tests (based on transmission of alleles within pedigrees) and to alternative approaches implemented in the software packages MQLS, ROADTRIPS and MASTOR. We find strong concordance between the results from different LMM approaches, and all are successful in controlling the genome-wide error rate (except for some approaches when applied naively to longitudinal data with many repeated measures). We also find high correlation between LMMs and alternative approaches (apart from transmission-based approaches when applied to SNPs with small or non-existent effects). We conclude that LMM approaches perform well in comparison to competing approaches. Given their strong concordance, in most applications, the choice of precise LMM implementation cannot be based on power/type I error considerations but must instead be based on considerations such as speed and ease-of-use. © 2014 Eu-ahsunthornwattana et al. 2018-11-09T01:48:37Z 2018-11-09T01:48:37Z 2014-01-01 Article PLoS Genetics. Vol.10, No.7 (2014) 10.1371/journal.pgen.1004445 15537404 15537390 2-s2.0-84905455421 https://repository.li.mahidol.ac.th/handle/123456789/33170 Mahidol University SCOPUS https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84905455421&origin=inward
institution Mahidol University
building Mahidol University Library
continent Asia
country Thailand
Thailand
content_provider Mahidol University Library
collection Mahidol University Institutional Repository
topic Agricultural and Biological Sciences
Biochemistry, Genetics and Molecular Biology
Medicine
spellingShingle Agricultural and Biological Sciences
Biochemistry, Genetics and Molecular Biology
Medicine
Jakris Eu-ahsunthornwattana
E. Nancy Miller
Michaela Fakiola
Selma M.B. Jeronimo
Jenefer M. Blackwell
Heather J. Cordell
Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data
description Approaches based on linear mixed models (LMMs) have recently gained popularity for modelling population substructure and relatedness in genome-wide association studies. In the last few years, a bewildering variety of different LMM methods/software packages have been developed, but it is not always clear how (or indeed whether) any newly-proposed method differs from previously-proposed implementations. Here we compare the performance of several LMM approaches (and software implementations, including EMMAX, GenABEL, FaST-LMM, Mendel, GEMMA and MMM) via their application to a genome-wide association study of visceral leishmaniasis in 348 Brazilian families comprising 3626 individuals (1972 genotyped). The implementations differ in precise details of methodology implemented and through various user-chosen options such as the method and number of SNPs used to estimate the kinship (relatedness) matrix. We investigate sensitivity to these choices and the success (or otherwise) of the approaches in controlling the overall genome-wide error-rate for both real and simulated phenotypes. We compare the LMM results to those obtained using traditional family-based association tests (based on transmission of alleles within pedigrees) and to alternative approaches implemented in the software packages MQLS, ROADTRIPS and MASTOR. We find strong concordance between the results from different LMM approaches, and all are successful in controlling the genome-wide error rate (except for some approaches when applied naively to longitudinal data with many repeated measures). We also find high correlation between LMMs and alternative approaches (apart from transmission-based approaches when applied to SNPs with small or non-existent effects). We conclude that LMM approaches perform well in comparison to competing approaches. Given their strong concordance, in most applications, the choice of precise LMM implementation cannot be based on power/type I error considerations but must instead be based on considerations such as speed and ease-of-use. © 2014 Eu-ahsunthornwattana et al.
author2 Newcastle University, United Kingdom
author_facet Newcastle University, United Kingdom
Jakris Eu-ahsunthornwattana
E. Nancy Miller
Michaela Fakiola
Selma M.B. Jeronimo
Jenefer M. Blackwell
Heather J. Cordell
format Article
author Jakris Eu-ahsunthornwattana
E. Nancy Miller
Michaela Fakiola
Selma M.B. Jeronimo
Jenefer M. Blackwell
Heather J. Cordell
author_sort Jakris Eu-ahsunthornwattana
title Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data
title_short Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data
title_full Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data
title_fullStr Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data
title_full_unstemmed Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data
title_sort comparison of methods to account for relatedness in genome-wide association studies with family-based data
publishDate 2018
url https://repository.li.mahidol.ac.th/handle/123456789/33170
_version_ 1763487895963828224