Data from: Introgression underlies phylogenetic uncertainty but not parallel plumage evolution in a recent songbird radiation
Data files
Oct 11, 2023 version files 3.95 GB
-
ASTRAL_4sp_trees.tre
23.75 MB
-
ASTRAL_5sp_trees.trees
26.02 MB
-
README.md
1.66 KB
-
SNAPPER_4sp_snps_autosomes.vcf
26.84 MB
-
SNAPPER_4sp_snps_chrZ.gz
132.43 KB
-
SNAPPER_5sp_snps_autosomes.vcf
18.13 MB
-
SNPs_chr1_topology_weighting.geno
360.22 MB
-
SNPs_chr10_topology_weighting.geno
62.49 MB
-
SNPs_chr11_topology_weighting.geno
65 MB
-
SNPs_chr12_topology_weighting.geno
65.30 MB
-
SNPs_chr13_topology_weighting.geno
57.64 MB
-
SNPs_chr14_topology_weighting.geno
51.54 MB
-
SNPs_chr15_topology_weighting.geno
41.95 MB
-
SNPs_chr17_topology_weighting.geno
33.78 MB
-
SNPs_chr18_topology_weighting.geno
33.19 MB
-
SNPs_chr19_topology_weighting.geno
33.22 MB
-
SNPs_chr1A_topology_weighting.geno
224.27 MB
-
SNPs_chr2_topology_weighting.geno
479.87 MB
-
SNPs_chr20_topology_weighting.geno
45.66 MB
-
SNPs_chr21_topology_weighting.geno
22.07 MB
-
SNPs_chr22_topology_weighting.geno
11.71 MB
-
SNPs_chr23_topology_weighting.geno
18.97 MB
-
SNPs_chr24_topology_weighting.geno
21.81 MB
-
SNPs_chr25_topology_weighting.geno
5.97 MB
-
SNPs_chr26_topology_weighting.geno
18.13 MB
-
SNPs_chr27_topology_weighting.geno
14.08 MB
-
SNPs_chr28_topology_weighting.geno
14.39 MB
-
SNPs_chr29_topology_weighting.geno
4.54 MB
-
SNPs_chr3_topology_weighting.geno
354.10 MB
-
SNPs_chr4_topology_weighting.geno
231.92 MB
-
SNPs_chr4A_topology_weighting.geno
62.49 MB
-
SNPs_chr5_topology_weighting.geno
192.54 MB
-
SNPs_chr6_topology_weighting.geno
115.64 MB
-
SNPs_chr7_topology_weighting.geno
123.18 MB
-
SNPs_chr8_topology_weighting.geno
98.41 MB
-
SNPs_chr9_topology_weighting.geno
82.35 MB
-
SNPs_chrZ_topology_weighting.geno
910.98 MB
Abstract
Instances of parallel phenotypic evolution offer great opportunities to understand the evolutionary processes underlying phenotypic changes. However, confirming parallel phenotypic evolution and studying its causes requires a robust phylogenetic framework. One such example is the “black-and-white wagtails”, a group of five species in the songbird genus Motacilla: one species, the White Wagtail (M. alba), shows wide intra-specific plumage variation, while the four others form two pairs of very similar-looking species (African Pied Wagtail M. aguimp + Mekong Wagtail M. samveasnae and Japanese Wagtail M. grandis + White-browed Wagtail M. maderaspatensis, respectively). However, the two species in each of these pairs were not recovered as sisters in previous phylogenetic inferences. Their relationships varied depending on the markers used, suggesting that gene tree heterogeneity might have hampered accurate phylogenetic inference. Here, we use whole genome resequencing data to explore the phylogenetic relationships within this group, with a special emphasis on characterizing the extent of gene tree heterogeneity and its underlying causes. We first used multispecies coalescent methods to generate a “complete evidence” phylogenetic hypothesis based on genome-wide variants, while accounting for incomplete lineage sorting and introgression. We then investigated the variation in phylogenetic signal across the genome, to quantify the extent of discordance across genomic regions, and test its underlying causes. We found that wagtail genomes are mosaics of regions supporting variable genealogies, because of ILS and inter-specific introgression. The most common topology across the genome, supporting M. alba and M. aguimp as sister species, appears to be influenced by ancient introgression. Additionally, we inferred another ancient introgression event, between M. alba and M. grandis. By combining results from multiple analyses, we propose a phylogenetic network for the black-and-white wagtails that confirms that similar phenotypes evolved in non-sister lineages, supporting parallel plumage evolution. Furthermore, the inferred reticulations do not connect species with similar plumage coloration, suggesting that introgression does not underlie parallel plumage evolution in this group. Our results demonstrate the importance of investigation of genome-wide patterns of gene tree heterogeneity to help understanding the mechanisms underlying phenotypic evolution.
README: Data from: Introgression underlies phylogenetic uncertainty but not parallel plumage evolution in a recent songbird radiation
This data contains genome-wide variants from 29 individuals of six species of wagtails (Motacilla), derived from whole-genome resequencing data. Variants were called based on a White Wagtail (Motacilla alba) reference genome. This data was used to perform a variety of phylogenetic analyses.
Description of the data and file structure
The repository contains three types of data files:
--> ASTRAL_sp_trees.trees: two files containing phylogenetic trees inferred in non-overlapping 50kb sliding windows, with all species (5sp) or a subset of four species (4sp). These files were used to perform species tree inference with ASTRAL.
--> SNAPPER_snps.vcf: three vcf files containing genome-wide variants thinned to 5kb for the autosomes and chromosome Z separately. For the autosomes, two files are provided (5sp and 4sp), the former with all species and the latter with a subset of 4 species. Theses files were used to infer phylogenetic trees with SNAPPER and phylogenetic networks with SNaQ!.
--> SNPs_chr_topology_weighting.geno: 31 variant files (1 per chromosome) containing all genome wide variants for the focal species. These files are in the geno format, which is similar to vcf and generated/used by the genomics general package (https://github.com/simonhmartin/genomics_general). These files were used for topology weighting and calculations of Dxy in sliding windows.
In addition, the file Rancilhac_2023_wagtail_phylogeny_supplements.docx contains the supplementary material for the manuscript.