A computationally efficient Bayesian seemingly unrelated regressions model for high-dimensional quantitative trait loci discovery |
|
Author: | Bottolo, Leonardo1,2,3; Banterle, Marco4; Richardson, Sylvia2,3; |
Organizations: |
1Department of Medical Genetics, University of Cambridge, Cambridge, UK 2The Alan Turing Institute, London, UK 3MRC Biostatistics Unit, Cambridge, UK
4Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, UK
5Computational Medicine, Faculty of Medicine, University of Oulu and Biocenter Oulu, Oulu, Finland 6NMR Metabolomics Laboratory, School of Pharmacy, University of Eastern Finland, Kuopio, Finland 7Center for Life Course Health Research, University of Oulu, Oulu, Finland 8Biocenter Oulu, University of Oulu, Oulu, Finland 9Department of Epidemiology and Biostatistics, Imperial College London, London, UK 10MRC-PHE Centre for Environment and Health, Imperial College London, London, UK 11Department of Life Sciences, Brunel University London, Uxbridge, UK |
Format: | article |
Version: | published version |
Access: | open |
Online Access: | PDF Full Text (PDF, 5.8 MB) |
Persistent link: | http://urn.fi/urn:nbn:fi-fe2021090144889 |
Language: | English |
Published: |
John Wiley & Sons,
2021
|
Publish Date: | 2021-09-01 |
Description: |
AbstractOur work is motivated by the search for metabolite quantitative trait loci (QTL) in a cohort of more than 5000 people. There are 158 metabolites measured by NMR spectroscopy in the 31-year follow-up of the Northern Finland Birth Cohort 1966 (NFBC66). These metabolites, as with many multivariate phenotypes produced by high-throughput biomarker technology, exhibit strong correlation structures. Existing approaches for combining such data with genetic variants for multivariate QTL analysis generally ignore phenotypic correlations or make restrictive assumptions about the associations between phenotypes and genetic loci. We present a computationally efficient Bayesian seemingly unrelated regressions model for high-dimensional data, with cell-sparse variable selection and sparse graphical structure for covariance selection. Cell sparsity allows different phenotype responses to be associated with different genetic predictors and the graphical structure is used to represent the conditional dependencies between phenotype variables. To achieve feasible computation of the large model space, we exploit a factorisation of the covariance matrix. Applying the model to the NFBC66 data with 9000 directly genotyped single nucleotide polymorphisms, we are able to simultaneously estimate genotype–phenotype associations and the residual dependence structure among the metabolites. The R package BayesSUR with full documentation is available at https://cran.r-project.org/web/packages/BayesSUR/ see all
|
Series: |
Journal of the Royal Statistical Society. C, Applied statistics |
ISSN: | 0035-9254 |
ISSN-E: | 1467-9876 |
ISSN-L: | 0035-9254 |
Volume: | 70 |
Issue: | 4 |
Pages: | 886 - 908 |
DOI: | 10.1111/rssc.12490 |
OADOI: | https://oadoi.org/10.1111/rssc.12490 |
Type of Publication: |
A1 Journal article – refereed |
Field of Science: |
3111 Biomedicine |
Subjects: | |
Funding: |
This work was supported by the UK Medical Research Council grant MR/M013138/1 ‘Methods and tools for structural models integrating multiple high-throughput omics data sets in genetic epidemiology’ (AL, LB, MB, MRJ and SR), the European Union Horizon 2020 grant ‘DynaHealth: Understanding the dynamic determinants of glucose homeostasis and social capability to promote healthy and active aging’ grant agreement No 633595 (AL and MRJ), the Medical Research Council grant MC_UP_0801/1 (SR) and The Alan Turing Institute under the Engineering and Physical Sciences Research Council grant EP/N510129/1 (LB and SR). MAK works in a unit that is supported by the University of Bristol and UK Medical Research Council (MC_UU_12013/1). The Baker Institute is supported in part by the Victorian Government’s Operational Infrastructure Support Program. |
EU Grant Number: |
(633595) DYNAHEALTH - Understanding the dynamic determinants of glucose homeostasis and social capability to promote Healthy and active aging |
Copyright information: |
© 2021 The Authors. Journal of the Royal Statistical Society: Series C (Applied Statistics) published by John Wiley & Sons Ltd on behalf of Royal Statistical Society. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
https://creativecommons.org/licenses/by/4.0/ |