Data from: Hybridization, maternal inheritance, and evolutionary divergence in Hypancistrus (Siluriformes: Loricariidae) from the Xingu River
Data files
Jun 01, 2026 version files 791.91 KB
-
13genes.treefile
1.15 KB
-
alignment_13genes.fa
209.53 KB
-
alignment_nd2_cytb_cox1.fa
68.81 KB
-
hypancistrus.nex
230.30 KB
-
hypancistrus.xml
248.72 KB
-
hypancistrus5-MCC.tre
27.04 KB
-
nd2_cytb_cox1.treefile
987 B
-
README.md
5.37 KB
Abstract
Catfishes of the genus Hypancistrus are among the most emblematic endemic species of the Xingu River rapids, a biodiversity hotspot of the Amazon Basin. In the unique Volta Grande region, the distributions of Hypancistrus zebra, Hypancistrus seideli, and Hypancistrus yudja overlap, creating a potential zone of biological interaction, and previous morphological analyses suggest hybridization between H. seideli and H. yudja. We investigated maternal inheritance and phylogenetic relationships among the three species and hybrids using complete mitochondrial genomes and nuclear genome size estimates. Phylogenetic analyses revealed that hybrids carried maternal lineages exclusively from H. zebra and H. seideli, with no evidence of contribution from H. yudja. Nuclear genome size analyses support this pattern, with hybrids exhibiting intermediate values consistent with additive inheritance. Divergence time estimates indicated a very recent evolutionary radiation (< 0.5 Ma), explaining the persistence of reproductive compatibility among the species. These findings raise conservation concerns, particularly for the zebra pleco H. zebra, a critically endangered species at risk of genetic introgression. Our study provides the first complete mitochondrial genome data for these species. It emphasizes the importance of integrating morphological and genomic approaches to understand hybridization dynamics and guide conservation strategies in the Xingu River.
Dataset DOI: 10.5061/dryad.7pvmcvf6p
Description of the data and file structure
This dataset includes mitochondrial sequence alignments, maximum likelihood phylogenies, and Bayesian molecular dating results used to investigate hybridization, maternal inheritance, and evolutionary divergence among Hypancistrus species from the Xingu River.
The dataset enables replication of the results presented in the manuscript and allows further comparative phylogenetic and molecular clock studies.
Repository Content
- Alignments (FASTA and NEXUS formats)
- Maximum Likelihood trees from IQ-TREE analyses (NEWICK format)
- Bayesian divergence dating analyses from BEAST (input XML + MCC tree output)
File Descriptions
Alignment Files
- alignment_13genes.fa — Concatenated alignment of 13 mitochondrial protein-coding genes (11,416 bp).
- alignment_nd2_cytb_cox1.fa — Concatenated alignment of ND2, CytB, and COX1 genes.
- hypancistrus.nex — NEXUS file for BEAST analysis, including sequence data and partition schemes.
Maximum Likelihood Trees (IQ-TREE outputs)
- 13genes.treefile — ML tree from the 13-gene dataset (NEWICK format).
- nd2_cytb_cox1.treefile — ML tree from ND2, CytB, COX1 dataset (NEWICK format).
Bayesian Dating Analysis (BEAST outputs)
- hypancistrus.xml — BEAST input file with models, priors, and MCMC settings.
- hypancistrus5-MCC.tre — Maximum Clade Credibility (MCC) tree (NEXUS format).
Definitions and Abbreviations
- ND2 = NADH dehydrogenase subunit 2
- CytB = Cytochrome b
- COX1 = Cytochrome oxidase subunit 1
- MCC tree = Maximum Clade Credibility tree
- No special codes for missing data were used.
Software and Tools Used
- IQ-TREE v2.2.0 — Maximum Likelihood phylogenetic analyses.
- BEAST v2.7 — Bayesian divergence dating analyses.
- FigTree v1.4.4 and iTOL — Visualization and annotation of trees.
Important Notes
- IQ-TREE outputs are in NEWICK format; BEAST MCC tree is in NEXUS format.
- Data can be reused for independent phylogenetic or molecular dating analyses.
Article Citation: Kerniske, F. F., Melo, B. F., Sousa, L. M., Degrandi, T. M., & Artoni, R. F. (2026). Hybridization, Maternal Inheritance, and Evolutionary Time of Divergence of Endangered Species of Hypancistrus (Siluriformes: Loricariidae) in the Xingu River. Ecology and Evolution, 16(5), e73624. https://doi.org/10.1002/ece3.73624
Files and variables
File: 13genes.treefile
Description: Maximum Likelihood (ML) phylogenetic tree inferred with IQ-TREE from the concatenated dataset of 13 mitochondrial protein-coding genes. Output in NEWICK format, includes branch lengths and support values.
File: alignment_13genes.fa
Description: Concatenated nucleotide alignment of 13 mitochondrial protein-coding genes (11,416 bp total). Format: FASTA. Used as input for ML and Bayesian analyses.
File: alignment_nd2_cytb_cox1.fa
Description: Concatenated nucleotide alignment of ND2, CytB, and COX1 genes. Format: FASTA. Used for complementary ML phylogenetic analyses.
File: hypancistrus5-MCC.tre
Description: Maximum Clade Credibility (MCC) tree summarizing posterior tree distribution from BEAST Bayesian dating analysis. Format: NEXUS. Includes divergence time estimates.
File: hypancistrus.nex
Description: Alignment file in NEXUS format containing sequence data and partition information. Used as input for BEAST analysis.
File: nd2_cytb_cox1.treefile
Description: Maximum Likelihood phylogenetic tree inferred with IQ-TREE from concatenated ND2, CytB, and COX1 dataset. Format: NEWICK.
File: hypancistrus.xml
Description: Input file for BEAST v2, specifying sequence partitions, substitution models, priors, and MCMC settings used in Bayesian divergence dating analysis.
Code/software
- IQ-TREE v2.2.0
- Used for Maximum Likelihood phylogenetic analyses.
- Input: FASTA alignment files (
*.fa). - Output: tree files (
*.treefile) in NEWICK format. - Website: http://www.iqtree.org
- BEAST v2.7 (Bayesian Evolutionary Analysis Sampling Trees)
- Used for divergence time estimation and Bayesian phylogenetic inference.
- Input: NEXUS alignment (
hypancistrus.nex) and XML configuration file (hypancistrus.xml). - Output: posterior tree distribution and Maximum Clade Credibility tree (
hypancistrus5-MCC.tre). - Website: https://beast2.org
- FigTree v1.4.4
- Used for tree visualization.
- Input: tree files in NEWICK and NEXUS formats.
- Website: http://tree.bio.ed.ac.uk/software/figtree/
- iTOL (Interactive Tree of Life)
- Online tool for visualization and annotation of phylogenetic trees.
- Input: tree files in NEWICK and NEXUS formats.
- Website: https://itol.embl.de
Access information
Other publicly accessible locations of the data:
- None
Data was derived from the following sources:
- None
