Data from: Regionally localized population genetic structure of a non-native terrestrial isopod (Porcellio spinicornis) (Isopoda: Porcellionidae)
Data files
May 20, 2026 version files 128.47 MB
-
Dataset1_SNP_tree.treefile
3.02 KB
-
Dataset1.phy
2.84 MB
-
Dataset1.vcf
96.94 MB
-
Dataset2.vcf
28.64 MB
-
P_spinicornis_COI.fa
40.81 KB
-
README.md
2.11 KB
Abstract
In the Prairie Provinces of Canada (Alberta, Saskatchewan and Manitoba), all sowbugs (terrestrial isopods) (Isopoda: Oniscidea) are non-native, but little is known of their origins and subsequent movements. We investigated the population structure of the sowbug Porcellio spinicornis Say in Alberta using genomic markers (ddRADseq) and mitochondrial COI barcodes. Populations of P. spinicornis in Edmonton, Calgary, and southeastern Alberta are genetically dissimilar and likely originated from at least three separate introduction events. Edmonton populations display genetic homogeneity consistent with anecdotal reports that sowbugs in Alberta arrived recently. Evidence for greenhouses as initial introduction sites was inconclusive, with at least one greenhouse population in Edmonton potentially originating from an adjacent outdoor site rather than serving as a source population itself. COI diversity in Alberta populations of P. spinicornis is low and consists entirely of haplotypes that are currently only reported from North America, suggesting that they may have originated from other North American populations rather than from native populations in Europe. Here, we provide the data matrices and tree files used in our analyses.
Data matrices and tree file for analyses (phylogenetic, ADMIXTURE, PCA, DAPC, kinship, etc.) of SNP data from 71 Porcellio spinicornis from Alberta, Canada. Data matrix for phylogenetic analysis of COI data from 55 Porcellio spinicornis from North America and Europe.
Description of the data and file structure
P_spinicornis_COI.fa: COI sequences from 55 Porcellio spinicornis from North America and Europe, aligned and trimmed for haplotype network construction and phylogenetic analysis. Sequences with "BOLD" in the name (N=24) are from the BOLD database (https://boldsystems.org/); sequence number 24 is from GenBank (https://www.ncbi.nlm.nih.gov/genbank/). The remaining 30 sequences are from our study.
Dataset1.vcf: Data matrix of 39,978 SNPs (30,706 informative SNPs) for 71 Porcellio spinicornis individuals. Filtered to <20% missing data per locus, minimum genotype quality of 30, minimum allele frequency 0.01. This dataset was used for all analyses except ADMIXTURE.
Dataset1.phy: Phylip version of the Dataset1.vcf file. This file was used for phylogenetic analysis. Please note that all sites (including non-informative sites) are included in this file; only 30,706 are informative.
Dataset1_SNP_tree.treefile: Maximum likelihood tree built from Dataset 1, with branch support values (1000 iterations of Ultrafast Bootstrap) in percentages. This tree was produced from Dataset1.phy using IQTree version 2.2.2.7 and the built-in ModelFinder which selected SYM+R4. The ascertainment bias correction option (+ASC) was applied.
Dataset2.vcf: Data matrix of 11,800 SNPs for 71 Porcellio spinicornis individuals. This dataset includes only one SNP per locus and was filtered to <20% missing data per locus, minimum genotype quality of 30, minimum allele frequency 0.01. This dataset was used only for the ADMIXTURE and kinship analyses.
