Data from: Accounting for heteroscedasticity and censoring in chromosome partitioning analyses

Kemppainen, Petri1; Husby, Arild1 2

Published Nov 07, 2018 on Dryad. https://doi.org/10.5061/dryad.d6736c7

Data files

Nov 07, 2018 version files 12.58 GB

data_NG_100.tar.gz

1.28 GB
data_NG_1000_2.tar.gz

6.29 GB
data_NG_pop_struct.tar.gz

299.86 MB
data_sim_vs_emp.tar.gz

4.39 GB
R_project_folder.tar.gz

312.85 MB
README_for_R_project_folder.tar.txt

1.38 KB

Abstract

A fundamental assumption in quantitative genetics is that traits are controlled by many loci of small effect. Using genomic data, this assumption can be tested using chromosome partitioning analyses, where the proportion of genetic variance for a trait explained by each chromosome (h2c), is regressed on its size. However, as h2c-estimates are necessarily positive (censoring) and the variance increases with chromosome size (heteroscedasticity), two fundamental assumptions of ordinary least squares (OLS) regression are violated. Using simulated and empirical data we demonstrate that these violations lead to incorrect inference of genetic architecture. The degree of bias depend mainly on the number of chromosomes and their size distribution and are therefore specific to the species; using published data across many different species we estimate that not accounting for this effect overall resulted in 28% false positives. We introduce a new and computationally efficient resampling method that corrects for inflation caused by heteroscedasticity and censoring and that works under a large range of data set sizes and genetic architectures in empirical data sets. Our new method substantially improves the robustness of inferences from chromosome partitioning analyses.

Data from: Accounting for heteroscedasticity and censoring in chromosome partitioning analyses

Data files

Abstract

R_project_folder.tar

data_NG_100-replace

data_NG_1000_2-replace

data_NG_pop_struct-replace

data_sim_vs_emp-replace

Data from: Accounting for heteroscedasticity and censoring in chromosome partitioning analyses

Data files

Abstract

Usage notes

R_project_folder.tar

data_NG_100-replace

data_NG_1000_2-replace

data_NG_pop_struct-replace

data_sim_vs_emp-replace

Works referencing this dataset