BayesW time-to-event analysis posterior outputs and summary statistics
Data files
Mar 03, 2021 version files 39.94 GB
-
ReadMe_BayesWData.md
5.70 KB
-
summary_UK_CAD_unrel_LD09_00001_4_GMAF5_LD4_q25_sR10_N12_n8.tar.gz
124.76 MB
-
summary_UK_HBP_unrel_LD09_00001_4_GMAF5_LD4_q25_sR10_N12_n8.tar.gz
124.58 MB
-
summary_UK_menarche_unrel_LD09_00001_4_GMAF5_LD4_q25_sR10_N10_n8.tar.gz
124.16 MB
-
summary_UK_menopause_unrel_LD09_00001_4_GMAF5_LD4_q25_sR10_N8_n8.tar.gz
123.67 MB
-
summary_UK_T2D_unrel_LD09_00001_4_GMAF5_LD4_q25_sR10_N12_n8.tar.gz
124.38 MB
-
UK_CAD_unrel_LD09_00001_4_GMAF5_LD4_q25_sR10_N12_n8.tar.gz
7.33 GB
-
UK_HBP_unrel_LD09_00001_4_GMAF5_LD4_q25_sR10_N12_n8.tar.gz
7.39 GB
-
UK_menarche_unrel_LD09_00001_4_GMAF5_LD4_q25_sR10_N10_n8.tar.gz
11.81 GB
-
UK_menopause_unrel_LD09_00001_4_GMAF5_LD4_q25_sR10_N8_n8.tar.gz
7.18 GB
-
UK_T2D_unrel_LD09_00001_4_GMAF5_LD4_q25_sR10_N12_n8.tar.gz
5.60 GB
Abstract
Here, we develop a Bayesian approach (BayesW) that provides probabilistic inference of the genetic architecture of age-at-onset phenotypes in a hybrid-parallel sampling scheme that facilitates Bayesian time-to-event large-scale biobank analyses. We show in extensive simulation work that BayesW achieves a greater number of discoveries, better model performance and improved genomic prediction as compared to other approaches. In the UK Biobank, we find many thousands of common genomic regions underlying the age-at-onset of high blood pressure (HBP), cardiac disease (CAD), and type-2 diabetes (T2D), and for the genetic basis of onset reflecting the underlying genetic liability to disease. Age-at-menopause and age-at-menarche are also highly polygenic, but with higher variance contributed by low-frequency variants. Genomic prediction into the Estonian Biobank data shows that BayesW gives higher prediction accuracy than other approaches.
Methods
The data consists of the posterior distributions of running BayesW model on five phenotypes in UK Biobank: age-at-menopause, age-at-menarche, and age-at-diagnosis of coronary artery disease, high blood pressure, or type-2-diabetes. The posterior distributions give the effect size estimates and the corresponding effect size mixture classification for each of the markers analysed.
In addition we provide summary statistics for each of the markers analysed, giving posterior means, standard deviations and inclusion probabilities.
Usage notes
The information included in the ReadMe file.