Data from: Caves as species pumps: Key innovations, isolation, and periodic introgression drive the world's largest cavefish radiation in a dynamic karstic landscape
Data files
Apr 30, 2026 version files 477.31 MB
-
ipyrad_GATK_vcfs.7z
477.30 MB
-
README.md
3.60 KB
Abstract
Species diversification arises from complex interactions among multiple drivers, such as gene flow, hybridization, key innovations, historical climate changes, geological events, and ecological opportunities. Yet, their relative contributions to large radiations remain inadequately understood. We investigate the interplay among these factors in the diversification of Sinocyclocheilus, a diverse cavefish radiation comprising 79 species. This genus spans a continuum from surface-dwelling forms with fully developed eyes and pigmentation to cave-dwelling forms with regressed eyes, pigment loss, and unique traits such as horns and a dorsal humps. Using reduced-representation genomic data (RADseq), we detect widespread gene flow across different species, with introgression playing a major role compared to incomplete lineage sorting in generating phylogenetic discordance and contributing genetic variation for cave adaptation and diversification in this group. Key traits, including eye degeneration, reduced pigmentation, and horn development, evolved independently multiple times as adaptations to cave environments. Furthermore, geological and climatic shift events, such as the uplift of the Tibetan plateau and the late Miocene cooling, significantly enhanced their speciation rates. Demographic analyses indicate population expansions during the Gonghe Movement and stability during the Last Glacial Maximum, possibly due to the buffering of cave refugia. Periodic introgression events promoted by isolation and reconnections due to the changing climate and geological activity, combined with the repeated evolution of key cave-adapted traits, emerge as primary drivers of this radiation. Our findings underscore the intricate interactions of these drivers in Sinocyclocheilus evolution, offering fresh insights into the processes driving cave adaptation and diversification.
Title of Dataset
Caves as species pumps: key innovations, isolation, and periodic introgression drive the world's largest cavefish radiation in a dynamic karstic landscape
Description of the data and file structure
- Description of dataset
These VCF files were generated using both the ipyrad and GATK pipelines to investigate the diversification of Sinocyclocheilus. Input files for downstream analyses can be readily generated from these VCF files using the corresponding bioinformatic tools described in the Methods section of the associated manuscript. - File list
ipyrad/All_individuals_ipyrad.vcf (IQ-TREE, SVDquartets, ASTRAL, Dsuite, Treemix, MSCquartets, SplitsTree)
ipyrad/SNAPP_time_tree_ipyrad.vcf (Downstream analysis: SNAPP, LTT, RPANDA, HISSE, ASR)
ipyrad/cladeA_ipyrad.vcf (Downstream analyses: Treemix, PhyloNetworks)
ipyrad/cladeB_ipyrad.vcf (Downstream analyses: Treemix, PhyloNetworks)
ipyrad/cladeC_ipyrad.vcf (Downstream analyses: Treemix, PhyloNetworks)
ipyrad/cladeD_ipyrad.vcf (Downstream analyses: Treemix, PhyloNetworks, STRUCTURE)
ipyrad/cladeE_ipyrad.vcf (Downstream analyses: Treemix, PhyloNetworks)
ipyrad/cladeF_ipyrad.vcf (Downstream analyses: Treemix, PhyloNetworks)
ipyrad/altishoulderus_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
ipyrad/cyphotergous_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
ipyrad/donglanensis_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
ipyrad/guanyangensis_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
ipyrad/guilinensis_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
ipyrad/huanjiangensis_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
ipyrad/lingyunensis_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
ipyrad/longibarbatus_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
ipyrad/mashanensis_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
ipyrad/microphthalmus_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
ipyrad/punctatus_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
ipyrad/qiubeiensis_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
ipyrad/tianlinensis_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
ipyrad/xunlensis_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
ipyrad/yishanensis_ipyrad.vcf (Downstream analysis: Stairway Plot 2)
GATK/All_individuals_GATK.vcf.gz (IQ-TREE, SVDquartets, ASTRAL, Dsuite, Treemix, MSCquartets, SplitsTree)
GATK/SNAPP_time_tree_GATK.vcf (Downstream analysis: SNAPP)
GATK/altishoulderus_GATK.vcf (Downstream analysis: Stairway Plot 2)
GATK/cyphotergous_GATK.vcf (Downstream analysis: Stairway Plot 2)
GATK/donglanensis_GATK.vcf (Downstream analysis: Stairway Plot 2)
GATK/guanyangensis_GATK.vcf (Downstream analysis: Stairway Plot 2)
GATK/guilinensis_GATK.vcf (Downstream analysis: Stairway Plot 2)
GATK/huanjiangensis_GATK.vcf (Downstream analysis: Stairway Plot 2)
GATK/lingyunensis_GATK.vcf (Downstream analysis: Stairway Plot 2)
GATK/longibarbatus_GATK.vcf (Downstream analysis: Stairway Plot 2)
GATK/mashanensis_GATK.vcf (Downstream analysis: Stairway Plot 2)
GATK/microphthalmus_GATK.vcf (Downstream analysis: Stairway Plot 2)
GATK/punctatus_GATK.vcf (Downstream analysis: Stairway Plot 2)
GATK/qiubeiensis_GATK.vcf (Downstream analysis: Stairway Plot 2)
GATK/tianlinensis_GATK.vcf (Downstream analysis: Stairway Plot 2)
GATK/xunlensis_GATK.vcf (Downstream analysis: Stairway Plot 2)
GATK/yishanensis_GATK.vcf (Downstream analysis: Stairway Plot 2)
All files have been deposited inipyrad_GATK_vcfs.7z.
