Data from: A shift to metapopulation genetic management for persistence of a species threatened by fragmentation: the case of an endangered Australian freshwater fish

Pavlova, Alexandra 1 ; Tonkin, Zeb2; Pearce, Luke3; Robledo-Ruiz, Diana 1 ; Lintermans, Mark4; Ingram, Brett5; Lyon, Jarod2; Beitzel, Matt6; Broadhurst, Ben7; Rourke, Meaghan8; Sturgiss, Felicity9; Lake, Erin3; Castrejón-Figueroa, Jesús10; Stocks, Jerom3; Sunnucks, Paul 1

Research facility: Australian Research Council

Published Jul 02, 2025 on Dryad. https://doi.org/10.5061/dryad.6q573n69j

Data files

Jul 02, 2025 version files 40 GB

id_pop.txt

10.12 KB
Pavlova-etal-2025_bamfiles.tar.gz

40 GB
README.md

2.53 KB

Abstract

In a world where habitats are degrading and the climate is warming and becoming unpredictable, biodiversity conservation efforts and funding remain grossly inadequate. As part of a multi-faceted approach to halting extinctions, shifting from preserving small, remnant populations to a practice of genetically connecting populations that recreate larger and more diverse populations is expected to be beneficial. This harnesses key evolutionary processes to promote species’ abilities to adapt to changing environments and to increase the likelihood of population persistence. We use the endangered Macquarie perch (Macquaria australasica) as a case study to develop a genetic strategy for metapopulation management aimed at promoting population growth and persistence. The Macquarie perch’s range has been highly fragmented, and its remaining habitat is at risk of catastrophic degradation due to climate change. We integrate results of new and existing genetic analyses to illustrate how genetically depauperate populations can benefit from admixture, and how the outcomes of management interventions can be quantified through genetic monitoring. We develop the pipeline JeDi (https://github.com/drobledoruiz/JeDi) for estimating unbiased individual heterozygosity, population nucleotide diversity,y and pairwise population divergence, using reduced-representation genome sequencing data and an assembled reference genome. We use this pipeline to estimate baseline data for monitoring of Macquarie perch populations and show that combining two genetic sources of migrants during population restoration resulted in doubling of nucleotide diversity compared to either source. Genetic diversity estimates using our pipeline are comparable across studies, datasets and species, and suitable for evaluating the rate of global biodiversity change.

Dataset DOI: 10.5061/dryad.6q573n69j

Description of the data and file structure

This repository contains genomic data for the manuscript:

Pavlova, A., Tonkin, Z., Pearce, L., Robledo-Ruiz, D., Lintermans, M., Ingram, B., Lyon, J., Beitzel, M., Broadhurst, B., Rourke, M., Sturgiss, F., Lake, E., Castrejón-Figueroa, J., Stocks, J., & Sunnucks, P. (2025). A shift to metapopulation genetic management for persistence of a species threatened by fragmentation: the case of an endangered Australian freshwater fish. Molecular Ecology. DOI:10.1111/mec.70005, accepted 5/6/2025.

The two files in this repository could be used as input for JeDi pipeline described in the manuscript. The JeDi pipeline estimates unbiased individual heterozygosity, population nucleotide diversity and pairwise population divergence, using reduced-representation genome sequencing data (BAM files) and an assembled reference genome (available from DNA Zoo website).

Files and variables

File: Pavlova-etal-2025_bamfiles.tar.gz

Description: This compressed file is an archive of 719 BAM files for 719 Macquarie perch Macquaria australasica individuals sampled from 20 populations across the species' range. Use tar with gzip option to open the archive. Each BAM file contains single-read sequencing data from reduced-representation genome sequencing (DArTseq) mapped onto the chromosomal-length genome assembly DU_Maus_v1.0_HiC.fasta.gz.

File: id_pop.txt

Description: This file contains no headers and includes two columns, individual identifier and population identifier (assigning each of the 719 individuals to a population).

Variables

individual ID
population ID

Code/software

Raw sequencing reads provided by DArT were pre-processed: Illumina adaptors were trimmed with fastp (Chen et al. 2018), and demultiplexed with process_radtags of STACKS package (Catchen et al. 2013). The processed reads were mapped to the chromosome-length Macquarie perch genome using BWA-MEM (Li and Durbin 2009) with parameter -R set to '@RG\tID:sample_name\tSM:sample_name'. Mapped reads per individual were sorted, filtered (min MAPQ ≥ 20) and transformed to BAM with samtools (Li et al. 2009).