Intraspecific genomic variation and local adaptation in a young hybrid species
Data files
Nov 30, 2020 version files 77.77 MB
-
A.Mapping.and.Realign_around_Indels.sh
3.96 KB
-
ACC_MF_list.txt
197 B
-
ACQ_MF_list.txt
80 B
-
Alt_Ita.only_maf0.02.bayescenv_fst.txt
298.37 KB
-
Alt_Italy.txt
145 B
-
AncestryProportion-HI-Fst.txt
207.14 KB
-
B.CombineGVCF.and.MergeVCF.sh
3.58 KB
-
beak.Inland.Italians.txt
302 B
-
BH_Ita.only_maf0.02.bayescenv_fst.txt
298.35 KB
-
BH_Italy.txt
143 B
-
BL_Ita.only_maf0.02.bayescenv_fst.txt
297.97 KB
-
BL_Italy.txt
145 B
-
C.Variant_Calling.sh
1.24 KB
-
Combine_SNPs_RecombRate.R
6.22 KB
-
CRO_MF_list.txt
179 B
-
Cuevas.et.al_MolEcol.2020.R
88.13 KB
-
D.ExtractingSNPs_and_ExcludingScaffolds.sh
2.79 KB
-
E.Identify_and_Excluding_missingData_samples.sh
2.04 KB
-
EBO_MF_list.txt
116 B
-
estimate_ancestry.py
4.53 KB
-
F.FilteringSNPs.sh
3.04 KB
-
Fig4A_within.Ita.FST_v.s_HouSpa.FST.txt
512.63 KB
-
Fig4B_SpaIta.FST_v.s_HouIta.FST.txt
543.08 KB
-
Fig4C_Ita.FST_v.s_Hou.FST.txt
676.06 KB
-
Fig4D_Ita.FST_v.s_Spa.FST.txt
233.36 KB
-
FONT_M_list.txt
69 B
-
Fst_Ancestry_chr1_weighted_collapsed.txt
1.31 MB
-
Fst_Ancestry_chr10_weighted_collapsed.txt
255.10 KB
-
Fst_Ancestry_chr11_weighted_collapsed.txt
246.63 KB
-
Fst_Ancestry_chr12_weighted_collapsed.txt
179.38 KB
-
Fst_Ancestry_chr13_weighted_collapsed.txt
116.39 KB
-
Fst_Ancestry_chr14_weighted_collapsed.txt
134.66 KB
-
Fst_Ancestry_chr15_weighted_collapsed.txt
111.81 KB
-
Fst_Ancestry_chr17_weighted_collapsed.txt
24.47 KB
-
Fst_Ancestry_chr18_weighted_collapsed.txt
91.56 KB
-
Fst_Ancestry_chr19_weighted_collapsed.txt
58.26 KB
-
Fst_Ancestry_chr1A_weighted_collapsed.txt
875.70 KB
-
Fst_Ancestry_chr2_weighted_collapsed.txt
1.68 MB
-
Fst_Ancestry_chr20_weighted_collapsed.txt
66.98 KB
-
Fst_Ancestry_chr21_weighted_collapsed.txt
26.38 KB
-
Fst_Ancestry_chr24_weighted_collapsed.txt
17.10 KB
-
Fst_Ancestry_chr26_weighted_collapsed.txt
9.89 KB
-
Fst_Ancestry_chr28_weighted_collapsed.txt
6.22 KB
-
Fst_Ancestry_chr3_weighted_collapsed.txt
1.43 MB
-
Fst_Ancestry_chr4_weighted_collapsed.txt
808.12 KB
-
Fst_Ancestry_chr5_weighted_collapsed.txt
778.11 KB
-
Fst_Ancestry_chr6_weighted_collapsed.txt
424.47 KB
-
Fst_Ancestry_chr7_weighted_collapsed.txt
438.11 KB
-
Fst_Ancestry_chr8_weighted_collapsed.txt
486.18 KB
-
Fst_Ancestry_chr9_weighted_collapsed.txt
181.40 KB
-
FST_House_MF.0.01_global_windowed.weir.fst
489.06 KB
-
fst_recomb_House.from.House_MF.tsv
253.67 KB
-
fst_recomb_Italian.from.Ita.only_MF.tsv
171.19 KB
-
fst_recomb_Spanish.from.Spanish_MF.tsv
51.34 KB
-
FST_Spanish_MF.0.01_global_windowed.weir.fst
125.68 KB
-
FST.House_House_MF.0.01_mod.weir.fst
134.82 KB
-
FST.House.outliers_in.FST.SpaHou.txt
8.73 KB
-
FST.Ita.global_Ita.only_MF.0.02_NORimini_mod.weir.fst
90.67 KB
-
FST.Ita.global.windowed_Ita.only_MF.0.02.windowed.weir.fst
368.33 KB
-
FST.Ita.only_MF.0.02_OutliersValues1.intervals
1.73 KB
-
FST.Italian.outliers_in.FST.SpaHou.txt
9.86 KB
-
FST.Spanish_Spanish_MF.0.01_mod.weir.fst
27.23 KB
-
FST.Spanish.outliers_in.FST.SpaHou.txt
5.45 KB
-
FST.v.s.TEMP.S.txt
1.10 KB
-
G.Transforming_vcf_to_ped_PLINK.sh
925 B
-
H.VCF_readDepth.sh
503 B
-
H.VCF_SiteMeanDepth.sh
446 B
-
H.VCF_Validator.sh
284 B
-
Het_recomb_WGS.txt
1.06 MB
-
Het.AFD_Italian_wgs_All_divergent.loci_afd1_mod.txt
548.86 KB
-
House_maf0.01.bayescan-out_fst.txt
331.56 KB
-
House_maf0.01.coordinates.txt
91.82 KB
-
House_MF_list.txt
636 B
-
House_MF_maf0.01.vcf.gz
3.23 MB
-
House_MF_maf0.01.vcf.idx
64.81 KB
-
House_MF_non-maf-filtering.vcf.gz
4.24 MB
-
House_MF_non-maf-filtering.vcf.gz.csi
39.99 KB
-
House_MF_Norway_list.txt
138 B
-
I.ADMIXTURE.sh
1.39 KB
-
I.AlleleFreq_Heterozygocity.sh
2.37 KB
-
I.FST_statistic.sh
8.24 KB
-
I.IdentifyOutliers.sh
5.97 KB
-
I.Pi.sh
1.50 KB
-
I.TajimasD.sh
2.01 KB
-
ID.Ita.only_MF.maf0.02_copy.txt
4.92 KB
-
ID.Ita.only_MF.maf0.02.txt
4.66 KB
-
ID.Ita.Parent_MF.maf0.02_copy.txt
13.21 KB
-
ID.Ita.Parent_MF.maf0.02.txt
12.06 KB
-
Ita.Coordinates.txt
181 B
-
Ita.only_list.txt
1.06 KB
-
Ita.only_maf0.02.bayescan-out_fst.txt
227.36 KB
-
Ita.only_maf0.02.coordinates.txt
61.94 KB
-
Ita.only_MF_maf0.02_missingdata.imiss
3.79 KB
-
Ita.only_MF_maf0.02_raw.raw
1.30 MB
-
Ita.only_MF_maf0.02.2.Q
2.36 KB
-
Ita.only_MF_maf0.02.map
125.79 KB
-
Ita.Parent_MF_list.txt
2.30 KB
-
Ita.Parent_MF_maf0.02_missingdata.imiss
8.20 KB
-
Ita.Parent_MF_maf0.02_raw.raw
1.73 MB
-
Ita.Parent_MF_maf0.02.2.Q
5.19 KB
-
Ita.Parent_MF_maf0.02.map
79.72 KB
-
Ita.pureParents_population.list.txt
972 B
-
Italian.mainland_Parents_maf0.02.vcf.gz
4.76 MB
-
Italian.mainland_Parents_maf0.02.vcf.gz.tbi
22.68 KB
-
Italian.mainland_Parents_non-variant-included_non-maf-filtering.vcf.gz
17.85 MB
-
Italian.mainland_Parents_non-variant-included_non-maf-filtering.vcf.gz.csi
32.63 KB
-
Italian.only_maf0.02.vcf.gz
3.75 MB
-
Italian.only_maf0.02.vcf.idx
64.54 KB
-
Italian.only_non-maf-filtering.vcf.gz
6.70 MB
-
Italian.only_non-maf-filtering.vcf.gz.csi
35.99 KB
-
LAqu_MF_list.txt
89 B
-
LdB_MF_list.txt
107 B
-
LS_MF_list.txt
154 B
-
MAP_Ita.only_maf0.02.bayescenv_fst.txt
298.55 KB
-
MAP_Italy.txt
141 B
-
MAT_Ita.only_maf0.02.bayescenv_fst.txt
298.46 KB
-
MAT_Italy.txt
141 B
-
Matrix.Ita.NORimini.pairwise.Fst.txt
773 B
-
missing_list_House_MF.txt
14 B
-
missing_list_Ita.only_MF.txt
63 B
-
missing_list_Ita.Parent_MF.txt
115 B
-
missing_list_SpaHou_MF.txt
61 B
-
missing_list_Spanish_MF.txt
47 B
-
MMRR_function.R
2.79 KB
-
only.REF.Parents_maf0.05.vcf.gz
746.96 KB
-
only.REF.Parents_maf0.05.vcf.gz.csi
24.13 KB
-
Parent.Ita.Inland_list.txt
1.33 KB
-
per-gene_pop-analysis.txt
6.30 MB
-
Pi_House_MF_non-variants_non-maf.windowed_mod.pi
336.55 KB
-
Pi_House_MF_non-variants_non-maf.windowed.pi
278.38 KB
-
Pi_Ita.only_MF_NORimini_non-variants_non-maf.windowed_mod.pi
364.23 KB
-
Pi_Ita.only_MF_NORimini_non-variants_non-maf.windowed.pi
301.34 KB
-
Pi_Spanish_MF_non-variants_non-maf.windowed_mod.pi
330.16 KB
-
Pi_Spanish_MF_non-variants_non-maf.windowed.pi
272.74 KB
-
Plot_non-weighted.Ancestry.for.WGS.ipynb
21.20 KB
-
plot_R.function_bayescan.R
4.43 KB
-
population_def_Ita.only.txt
2.27 KB
-
PS_Ita.only_maf0.02.bayescenv_fst.txt
298.84 KB
-
PS_Italy.txt
144 B
-
R.1.Ancestry_per.chr.sh
1.37 KB
-
R.2.Modifying_Ancestry.matrix.sh
1.29 KB
-
R.3.Ancestry.matrix_weighted.by_AlleleFreq.sh
3.76 KB
-
R.4.Ancestry.for.RADs.sh
11.22 KB
-
README.txt
5.99 KB
-
Running_BayeScan.sh
508 B
-
Running_BayeScEnv.sh
687 B
-
SAN_MF_list.txt
125 B
-
SpaHouFST_in.intraspecific.outliers.txt
25.70 KB
-
Spanish_maf0.01.bayescan-out_fst.txt
66.36 KB
-
Spanish_maf0.01.coordinates.txt
18.66 KB
-
Spanish_MF_Kazakhstan_list.txt
86 B
-
Spanish_MF_list.txt
604 B
-
Spanish_MF_maf0.01.vcf.gz
712.41 KB
-
Spanish_MF_maf0.01.vcf.idx
64.03 KB
-
Spanish_MF_non-maf-filtering.vcf.gz
1.09 MB
-
Spanish_MF_non-maf-filtering.vcf.gz.csi
11.29 KB
-
Spanish.House_MF_maf0.01.vcf.gz
2.50 MB
-
Spanish.House_MF_maf0.01.vcf.gz.csi
14.97 KB
-
TajimaD_House_MF.noMAF_mod2.Tajima.D.txt
347.22 KB
-
TajimaD_House_MF.noMAF.txt
318.49 KB
-
TajimaD_Ita.only_MF.noMAF_mod2.Tajima.D.txt
347.76 KB
-
TajimaD_Ita.only_MF.noMAF.txt
316.79 KB
-
TajimaD_Spanish_MF.noMAF_mod2.Tajima.D.txt
319.07 KB
-
TajimaD_Spanish_MF.noMAF.txt
287.54 KB
-
Transforming_VCF-to-BAYESCAN.sh
929 B
-
Transforming_VCF-to-BAYESCENV.sh
931 B
-
TS_Ita.only_maf0.02.bayescenv_fst.txt
297.28 KB
-
TS_Italy.txt
143 B
-
VCF_to_BAYESCAN.spid
1.49 KB
-
VCF_to_BAYESCENV.spid
1.77 KB
Abstract
Hybridization increases genetic variation, hence hybrid species may have greater evolutionary potential once their admixed genomes have stabilized and incompatibilities have been purged. Yet, little is known about how such hybrid lineages evolve at the genomic level following their formation, in particular their adaptive potential. Here we investigate how the Italian sparrow (Passer italiae), a homoploid hybrid species, has evolved and locally adapted to its variable environment. Using restriction site-associated DNA sequencing (RAD-seq) on several populations across the Italian peninsula, we evaluate how genomic constraints and novel genetic variation have influenced population divergence and adaptation. We show that population divergence within this hybrid species has evolved in response to climatic variation, suggesting ongoing local adaptation. As found previously in other non-hybrid species, climatic differences appear to increase population differentiation. We also report strong population divergence in a gene known to affect beak morphology. Most of the strongly divergent loci among Italian sparrow populations do not seem to be differentiated between its parent species, the house and Spanish sparrows. Unlike in the hybrid, population divergence within each of the parental taxa has occurred mostly at loci with high allele frequency difference between the parental species, suggesting that novel combinations of parental alleles in the hybrid have not necessarily enhanced its evolutionary potential. Rather, our study suggests that constraints linked to incompatibilities may have restricted the evolution of this admixed genome, both during and after hybrid species formation.
Methods
Genomic DNA was purified from blood samples. Double digestion of the genomic DNA for ddRAD sequencing was performed using EcoR I and MseI restriction enzymes. Molecular identifier tags were added with PCR amplification. Library pools were size selected for fragments between 500-600bp. The size selected library pools were then sequenced using an Illumina Nextseq500 machine and the 1x75bp sequencing format. A series of genomic tools were used to filter raw reads and call variants, vcftools, plink, bcftools, GATK, among others.
Here we provide the VCF files and final data files generated from statistical analysis. As well as the scripts used to analyse the data. Scripts for processing the raw reads to obtain final VCF are included as well as the scripts used for the statistical analysis and final figures found in the study. All files used in the R scripts are also provided.
Usage notes
The README.txt file explain the different categories of files and those used by the scripts.