Pedigree-based assessment of recent population connectivity in a threatened rattlesnake
Data files
Mar 19, 2021 version files 61.90 GB
-
BayesianJAGSdispersal.R
1.52 KB
-
grll_pedigree.R
5.48 KB
-
params_RAD117_p1.txt
3.10 KB
-
pedigree_dummy_simulations.R
18.76 KB
-
RAD_117_P1_barcodes.txt
366 B
-
RAD_117_P2_barcodes.txt
240 B
-
RAD_118_P2_barcodes.txt
96 B
-
RAD_118_P2.txt
161 B
-
RAD_118_P3_2_barcodes.txt
208 B
-
RAD_118_P3.txt
380 B
-
RAD_118_P4_barcodes.txt
14 B
-
RAD_119_P4_barcodes.txt
272 B
-
RAD_119_P5_barcodes.txt
318 B
-
RAD_120_P1_barcodes.txt
14 B
-
RAD_120_P5_barcodes.txt
14 B
-
RAD_120_P6_barcodes.txt
382 B
-
RAD117_P1_S1_L001_R1_001.fastq.gz
5.36 GB
-
RAD117_P2_S2_L001_R1_001.fastq.gz
3.89 GB
-
RAD118_P2_S2_L008_R1_001.fastq.gz
2.44 GB
-
RAD118_P3_S3_L008_R1_001.fastq.gz
3.79 GB
-
RAD118_P4_S4_L008_R1_001.fastq.gz
252.48 MB
-
RAD119_P4_S3_L001_R1_001.fastq.gz
3.38 GB
-
RAD119_P5_S4_L001_R1_001.fastq.gz
4.19 GB
-
RAD120_P1_S5_L008_R1_001.fastq.gz
84.10 MB
-
RAD120_P5_S6_L008_R1_001.fastq.gz
200.40 MB
-
RAD120_P6_S7_L008_R1_001.fastq.gz
6.57 GB
-
RAD122_barcodes.txt
206 B
-
RAD122_S1_L001_R1_001.fastq.gz
1.36 GB
-
RAD122_S1_L002_R1_001.fastq.gz
1.31 GB
-
RAD123_barcodes.txt
285 B
-
RAD123_S2_L001_R1_001.fastq.gz
1.86 GB
-
RAD123_S2_L002_R1_001.fastq.gz
1.80 GB
-
RAD124_barcodes.txt
221 B
-
RAD124_S3_L001_R1_001.fastq.gz
1.56 GB
-
RAD124_S3_L002_R1_001.fastq.gz
1.51 GB
-
RAD125_barcodes.txt
322 B
-
RAD125_S4_L001_R1_001.fastq.gz
2.25 GB
-
RAD125_S4_L002_R1_001.fastq.gz
2.18 GB
-
RAD126_barcodes.txt
348 B
-
RAD126_S5_L001_R1_001.fastq.gz
2.44 GB
-
RAD126_S5_L002_R1_001.fastq.gz
2.36 GB
-
RAD133_barcodes.txt
393 B
-
RAD133_S30_L006_R1_001.fastq.gz
7.08 GB
-
RAD137_barcodes.txt
376 B
-
RAD137_S34_L006_R1_001.fastq.gz
6.02 GB
-
ReadMe.txt
1.32 KB
-
Sistrurus_catenatus_NeOHIO_genetics.vcf
436.76 KB
-
Sistrurus_catenatus_NeOHIO_pop_gen_info.csv
1.05 KB
Abstract
Managing endangered species in fragmented landscapes requires estimating dispersal rates between populations over contemporary timescales. Here we develop a new method for quantifying recent dispersal using genetic pedigree data for close and distant kin. Specifically, we describe an approach that infers missing shared ancestors between pairs of kin in habitat patches across a fragmented landscape. We then apply a stepping-stone model to assign unsampled individuals in the pedigree to probable locations based on minimizing the number of movements required to produce the observed locations in sampled kin pairs. Finally, we use all pairs of reconstructed parent-offspring sets to estimate dispersal rates between habitat patches under a Bayesian model. Our approach measures connectivity over the timescale represented by the small number of generations contained within the pedigree and so is appropriate for estimating the impacts of recent habitat changes due to human activity. We used our method to estimate recent movement between newly discovered populations of threatened Eastern Massasauga Rattlesnakes (Sistrurus catenatus) using data from 2996 RAD-based genetic loci. Our pedigree analyses found no evidence for contemporary connectivity between five genetic groups, but, as validation of our approach, showed high dispersal rates between sample sites within a single genetic cluster. We conclude that these five genetic clusters of Eastern Massasauga Rattlesnakes have small numbers of resident snakes and are demographically isolated conservation units. More broadly, our methodology can be widely applied to determine contemporary connectivity rates, independent of bias from shared genetic similarity due to ancestry that impacts other approaches.
Methods
Dataset is ddRADseq from 86 individual Sistrurus catenatus (Eastern Massasauga Rattlesnake). Individuals were sequenced on Illumina HiSeq2500 and HiSeq4000 platforms, using single-end reads size selected from 300-600bp using EcoR1 and Pst1 for our restriction enzymes. Reads were aligned against a reference genome using ipyrad, and filtering done in PLINK 2.0, with all 2996 loci called in every individual.
Usage notes
No missing values should be present in the data. All loci are bi-allelic, and anonymized survey sites linked to sample names have been uploaded in a separate document. Due to concerns over the sensitive nature of these sites, no GPS coordinates or morphometric data are reported. All S. catenatus in this dataset came from Ashtabula county, Ohio, USA. Rscripts have been uploaded as examples to follow, but due to removed location and morphology data, cannot be run.