Isolation in motion: how riverscape bridges and barriers shape the evolution of plants in fast-flowing river archipelagos
Data files
May 14, 2026 version files 106.11 MB
-
marathrum4diversity_stats.recode.vcf.zip
30.31 MB
-
Nuclear_SNPs.zip
37.49 MB
-
Plastome_SNPs.zip
133.30 KB
-
README.md
2.89 KB
-
TE_from_WGS.zip
69.85 KB
-
WGS_data_analysis.zip
38.11 MB
Abstract
Islands serve as natural laboratories for studying evolution. In fast-flowing rivers, rapids and waterfalls are discrete rocky habitats, acting as insular freshwater landscape units. In the Neotropics, these riverine archipelagos are inhabited one group of angiosperms, the Podostemaceae. We used Marathrum foeniculaceum, a species distributed across the Americas, to test ecological hypotheses derived from island biogeography. Using whole‐genome and plastome data from individuals collected across rivers in Panama and Colombia, we investigated the extent and constraints to gene flow at multiple spatial scales using tests of gene flow under the multispecies coalescent with migration model. We further tested the hypothesis that gene flow is asymmetric and follows the direction of river flow. Our results revealed strong genetic differentiation among independent drainage basins and significant isolation-by-distance, with overland distance posing a stronger barrier to gene flow than distance along continuous river courses. Gene flow identified between adjacent rivers likely reflects historical or episodic river connections rather than frequent overland dispersals. Comparisons between nuclear and plastid data indicate stronger constraints on seed than pollen dispersal. Phylogeographic patterns in Podostemaceae further suggest an east-to-west history of river connectivity across the Isthmus of Panama. Together, these findings show that river connectivity and flow directionality structure genetic variation in strictly riverine aquatic angiosperms. Rapids and waterfalls act as evolutionary islands where dispersal limitation, localized connectivity, and water flow interact to shape population differentiation in riverine landscapes.
Dataset DOI: 10.5061/dryad.hx3ffbgss
Description of the data and file structure
We have submitted our processed data and files for analyses complementing scripts and files in Github https://github.com/Bedoya-Research-Group/River_phylogeography/. The data submitted includes variant calls for inferences of structure and gene flow across space and target reference file for locus recovery from genome skimming as detailed below.
Files and variables
File: marathrum4diversity_stats.recode.vcf.zip
Description: VCF (Variant Call Format) file with SNP (Single Nucleotide Polymorphism) data for 74 individuals of Marathrum from Panama. SNPs were called by mapping short reads to a reference genome in Podostemaceae. Filtering of SNPs included indel removal, quality filtering, removal of singletons (<0.05), and missing data (>5%). One SNP per 1000 base pairs was selected to reduce linkage. This file was used for estimation of genetic diversity stats (to identify gene flow within rivers) in R (see code in Github)
File: Nuclear_SNPs.zip
Description: Zip file including 1) VCF file used for phylogenetic inference with nuclear data of Marathrum population from Panama and Colombia. 2) Alignment resulting from conversion of VCF to Phylip file and removal of invariant sites (from IUPAC ambiguity codes). 3) Maximum-likelihood phylogenetic tree inferred from the phylip aligment with iqtree.
File: Plastome_SNPs.zip
Description: Zip file including 1) text file with list of individuals that were removed from plastome analysis (samples from Colombia). 2) VCF file used for phylogenetic inference with plastome data of Marathrum population from Panama and Colombia (marathrum_for_tree_inference_plastome.vcf). 3) Alignment resulting from conversion of VCF to Phylip file and removal of invariant sites (from IUPAC ambiguity codes). 4) Maximum-likelihood phylogenetic tree inferred from the phylip aligment with iqtree. 5) VCF file used for population genetics analyses (i.e., excluding M. utile which was only used for testing the monophyly of M. foeniculaceum)
File: TE_from_WGS.zip
Description: Target reference file for locus recovery from genome-skimming data for MSC-M analyses.
File: WGS_data_analysis.zip
Description: VCF file (all_samples_filtered_final.recode.vcf) and name lists (text files) for creating subsets for population genetics analyses of nuclear data.
Access information
Other publicly accessible locations of the data:
