Data for dating in the dark: Elevated substitution rates in cave cockroaches (Blattodea: Nocticolidae) have negative impacts on molecular date estimates
Data files
Jul 26, 2023 version files 26.90 MB
-
16S_alignment.Renamed.nex
74.55 KB
-
ASTRAL_UCE60p_10pcon.tre
2.69 KB
-
ASTRAL_UCE60p.tre
2.70 KB
-
BEAST_mtPCG_FLC_comb_burn10_MCC_mean.tre
125.08 KB
-
BEAST_mtPCG_NoNoct_ranloc_NoNoct_comb_burn50_33.tre
99.07 KB
-
BEAST_mtPCG_NoNoct_rel_exp_NoNoct_comb_burn10_MCC_mean.tre
99.17 KB
-
BEAST_mtPCG_NoNoct_rel_ln_NoNoct_comb_burn10_MCC_mean.tre
99.07 KB
-
BEAST_mtPCG_NoNoct_strict_NoNoct_comb_burn10_MCC_mean.tre
69.30 KB
-
BEAST_mtPCG_ranloc_comb_burn40_MCC_mean.tre
125.05 KB
-
BEAST_mtPCG_rel_exp_comb_burn20_MCC_mean.tre
125.16 KB
-
BEAST_mtPCG_rel_ln_comb_burn20_MCC_mean.tre
125.01 KB
-
BEAST_mtPCG_strict_comb_burn10_MCC_mean.tre
87.28 KB
-
ExaBayes_mtPCG_NoNoct_run0_burn10_mean_MCC.tre
73.26 KB
-
ExaBayes_mtPCG_NoNoct_run1_burn10_mean_MCC.tre
73.23 KB
-
ExaBayes_mtPCG_NoNoct_run2_burn10_mean_MCC.tre
73.21 KB
-
ExaBayes_mtPCG_NoNoct_run3_burn10_mean_MCC.tre
73.26 KB
-
ExaBayes_mtPCG_run0_burn10_mean_MCC.tre
92.78 KB
-
ExaBayes_mtPCG_run1_burn10_mean_MCC.tre
92.78 KB
-
ExaBayes_mtPCG_run2_burn10_mean_MCC.tre
92.66 KB
-
ExaBayes_mtPCG_run3_burn10_mean_MCC.tre
92.77 KB
-
ExaBayes_UCE_NoNoct_run0_unpart_burn10_mean_MCC.Renamed.tre
30.72 KB
-
ExaBayes_UCE_NoNoct_run1_unpart_burn10_mean_MCC.Renamed.tre
30.74 KB
-
ExaBayes_UCE_NoNoct_run2_unpart_burn10_mean_MCC.Renamed.tre
30.73 KB
-
ExaBayes_UCE_NoNoct_run3_unpart_burn10_mean_MCC.Renamed.tre
30.74 KB
-
ExaBayes_UCE_run0_unpart_burn10_mean_MCC.tre
48.56 KB
-
ExaBayes_UCE_run1_unpart_burn10_mean_MCC.tre
48.57 KB
-
ExaBayes_UCE_run2_unpart_burn10_mean_MCC.tre
48.56 KB
-
ExaBayes_UCE_run3_unpart_burn10_mean_MCC.tre
48.55 KB
-
IQTREE_16S.tre
8.33 KB
-
IQTREE_mtPCG.tre
8.18 KB
-
IQTREE_part_al4_UCE_60p_1_NoNoct.Renamed.tre
2.03 KB
-
IQTREE_PCG4_named_2.fasta.c2_NoNoct.fas.tre
6.10 KB
-
IQTREE_UCE60p_mito_16S.tre
11.22 KB
-
IQTREE_UCE60p.tre
3.43 KB
-
MCMCtree_mtPCG_AC1.tre
9.87 KB
-
MCMCtree_mtPCG_AC2.tre
9.86 KB
-
MCMCtree_mtPCG_AC3.tre
9.74 KB
-
MCMCtree_mtPCG_ILN1.tre
9.88 KB
-
MCMCtree_mtPCG_ILN2.tre
9.88 KB
-
MCMCtree_mtPCG_ILN3.tre
9.88 KB
-
MCMCtree_mtPCG_NoNoct_auto1.tre
7.80 KB
-
MCMCtree_mtPCG_NoNoct_auto2.tre
7.80 KB
-
MCMCtree_mtPCG_NoNoct_auto3.tre
7.80 KB
-
MCMCtree_mtPCG_NoNoct_uncor1.tre
7.81 KB
-
MCMCtree_mtPCG_NoNoct_uncor2.tre
7.81 KB
-
MCMCtree_mtPCG_NoNoct_uncor3.tre
7.82 KB
-
MCMCtree_UCE_NoNoct_auto1.Renamed.tre
2.66 KB
-
MCMCtree_UCE_NoNoct_auto2.Renamed.tre
2.66 KB
-
MCMCtree_UCE_NoNoct_auto3.Renamed.tre
2.65 KB
-
MCMCtree_UCE_NoNoct_uncor1.Renamed.tre
2.66 KB
-
MCMCtree_UCE_NoNoct_uncor2.Renamed.tre
2.66 KB
-
MCMCtree_UCE_NoNoct_uncor3.Renamed.tre
2.66 KB
-
mtPCG_alignment.Renamed.nex
1.27 MB
-
mtPCG_partition.nex
459 B
-
PhyloMAd_Noct.csv
1.42 KB
-
PhyloMAd_NoNoct.csv
1.65 KB
-
README.md
1.37 KB
-
UCE60p_alignment.Renamed.nex
23.38 MB
-
UCE60p_partition.nex
71.20 KB
Dec 10, 2023 version files 155.03 MB
-
16S_alignment.Renamed.Renamed.nex
74.53 KB
-
ASTRAL_UCE60p_10pcon.Renamed.tre
2.69 KB
-
ASTRAL_UCE60p.Renamed.tre
2.69 KB
-
BEAST_mtPCG_FLC_comb_burn10_MCC_mean.Renamed.tre
125.24 KB
-
BEAST_mtPCG_NoNoct_ranloc_NoNoct_comb_burn50_33.Renamed.tre
99.24 KB
-
BEAST_mtPCG_NoNoct_rel_exp_NoNoct_comb_burn10_MCC_mean.Renamed.tre
99.34 KB
-
BEAST_mtPCG_NoNoct_rel_ln_NoNoct_comb_burn10_MCC_mean.Renamed.tre
99.24 KB
-
BEAST_mtPCG_NoNoct_strict_NoNoct_comb_burn10_MCC_mean.Renamed.tre
69.46 KB
-
BEAST_mtPCG_ranloc_comb_burn40_MCC_mean.Renamed.tre
125.22 KB
-
BEAST_mtPCG_rel_exp_comb_burn20_MCC_mean.Renamed.tre
125.33 KB
-
BEAST_mtPCG_rel_ln_comb_burn20_MCC_mean.Renamed.tre
125.17 KB
-
BEAST_mtPCG_strict_comb_burn10_MCC_mean.Renamed.tre
87.45 KB
-
ExaBayes_mtPCG_NoNoct_run0_burn10_mean_MCC.Renamed.tre
73.43 KB
-
ExaBayes_mtPCG_NoNoct_run1_burn10_mean_MCC.Renamed.tre
73.40 KB
-
ExaBayes_mtPCG_NoNoct_run2_burn10_mean_MCC.Renamed.tre
73.38 KB
-
ExaBayes_mtPCG_NoNoct_run3_burn10_mean_MCC.Renamed.tre
73.43 KB
-
ExaBayes_mtPCG_run0_burn10_mean_MCC.Renamed.tre
92.96 KB
-
ExaBayes_mtPCG_run1_burn10_mean_MCC.Renamed.tre
92.95 KB
-
ExaBayes_mtPCG_run2_burn10_mean_MCC.Renamed.tre
92.84 KB
-
ExaBayes_mtPCG_run3_burn10_mean_MCC.Renamed.tre
92.95 KB
-
ExaBayes_UCE_NoNoct_run0_unpart_burn10_mean_MCC.Renamed.tre
30.72 KB
-
ExaBayes_UCE_NoNoct_run1_unpart_burn10_mean_MCC.Renamed.tre
30.74 KB
-
ExaBayes_UCE_NoNoct_run2_unpart_burn10_mean_MCC.Renamed.tre
30.73 KB
-
ExaBayes_UCE_NoNoct_run3_unpart_burn10_mean_MCC.Renamed.tre
30.74 KB
-
ExaBayes_UCE_run0_unpart_burn10_mean_MCC.Renamed.tre
48.55 KB
-
ExaBayes_UCE_run1_unpart_burn10_mean_MCC.Renamed.tre
48.57 KB
-
ExaBayes_UCE_run2_unpart_burn10_mean_MCC.Renamed.tre
48.55 KB
-
ExaBayes_UCE_run3_unpart_burn10_mean_MCC.Renamed.tre
48.55 KB
-
IQTREE_16S.Renamed.tre
14.60 KB
-
IQTREE_mtPCG_NoNoct.Renamed.tre
6.18 KB
-
IQTREE_mtPCG.Renamed.tre
8.27 KB
-
IQTREE_UCE_60p_NoNoct.Renamed.tre
2.03 KB
-
IQTREE_UCE60p_mito_16S.Renamed.tre
11.22 KB
-
IQTREE_UCE60p.Renamed.tre
3.43 KB
-
MCMCtree_mtPCG_AC1.Renamed.tre
9.96 KB
-
MCMCtree_mtPCG_AC2.Renamed.tre
9.95 KB
-
MCMCtree_mtPCG_AC3.Renamed.tre
9.82 KB
-
MCMCtree_mtPCG_ILN1.Renamed.tre
9.97 KB
-
MCMCtree_mtPCG_ILN2.Renamed.tre
9.97 KB
-
MCMCtree_mtPCG_ILN3.Renamed.tre
9.97 KB
-
MCMCtree_mtPCG_NoNoct_auto1.Renamed.tre
7.88 KB
-
MCMCtree_mtPCG_NoNoct_auto2.Renamed.tre
7.88 KB
-
MCMCtree_mtPCG_NoNoct_auto3.Renamed.tre
7.88 KB
-
MCMCtree_mtPCG_NoNoct_uncor1.Renamed.tre
7.90 KB
-
MCMCtree_mtPCG_NoNoct_uncor2.Renamed.tre
7.89 KB
-
MCMCtree_mtPCG_NoNoct_uncor3.Renamed.tre
7.90 KB
-
MCMCtree_UCE_NoNoct_auto1.Renamed.tre
2.66 KB
-
MCMCtree_UCE_NoNoct_auto2.Renamed.tre
2.66 KB
-
MCMCtree_UCE_NoNoct_auto3.Renamed.tre
2.65 KB
-
MCMCtree_UCE_NoNoct_uncor1.Renamed.tre
2.66 KB
-
MCMCtree_UCE_NoNoct_uncor2.Renamed.tre
2.66 KB
-
MCMCtree_UCE_NoNoct_uncor3.Renamed.tre
2.66 KB
-
mtPCG_alignment.Renamed.Renamed.nex
1.27 MB
-
mtPCG_partition.Renamed.nex
459 B
-
PhyloMAd_Noct.csv
1.42 KB
-
PhyloMAd_NoNoct.csv
1.65 KB
-
README.md
2.31 KB
-
ROA_UCE_DB_CONTRIB_1_ids.txt
2.28 KB
-
ROA_UCE_DB_CONTRIB_1.fasta.gz
113.69 MB
-
roaches-v1-master-probe-list-DUPE-SCREENED.fasta.gz
14.42 MB
-
UCE60p_alignment.Renamed.Renamed.nex
23.38 MB
-
UCE60p_partition.Renamed.nex
71.20 KB
Abstract
Rates of nucleotide substitution vary substantially across the Tree of Life, with potentially confounding effects on phylogenetic and evolutionary analyses. A large acceleration in mitochondrial substitution rate occurs in the cockroach family Nocticolidae, which predominantly inhabit subterranean environments. To evaluate the impacts of this among-lineage rate heterogeneity on estimates of phylogenetic relationships and evolutionary timescales, we analysed nuclear ultraconserved elements (UCEs) and mitochondrial genomes from nocticolids and other cockroaches. Substitution rates were substantially elevated in nocticolid lineages compared with other cockroaches, especially in mitochondrial protein-coding genes. This disparity in evolutionary rates is likely to have led to different evolutionary relationships being supported by mitochondrial genomes and UCE loci. Furthermore, analyses using relaxed-clock models inferred much deeper divergence times compared with a flexible local clock. Our phylogenetic analysis of UCEs, which is the first genome-scale study to include all nine major cockroach families, unites Corydiidae and Nocticolidae and places Anaplectidae as the sister lineage to the rest of Blattoidea. We uncover an extraordinary level of genetic divergence in Nocticolidae, including two highly distinct clades that separated ~115 million years ago despite both containing representatives of the genus Nocticola. The results of our study highlight the potential impacts of high among-lineage rate variation on estimates of phylogenetic relationships and evolutionary timescales.
The data provided consists of the output tree files (.tre) from each of the phylogenetic analyses and the associated sequence alignments and partition files (.nex) for each of the datasets.
Description of the Data and file structure
The datasets are labelled as:
- “16S” refers to the dataset containing a 440 bp fragment of the mitochondrial gene encoding 16S rRNA (16S). Note that some internal regions have been trimmed from the original alignment.
- "mtPCG" refers to the dataset containing a concatenated alignment of 13 mitochondrial protein-coding genes. Note that all codon positions are present in the alignment, but only second codon positions were used in analyses
- "UCE60p" refers to the dataset containing a concatenated alignment of the UCE loci included in our 60 percent completeness matrix
- Files containing *NoNoct* are the trees produced after removing representatives of Nocticolidae from the alignment. All other files are trees estimated using all taxa.
- PhyloMAd* files are the output files from our test of substitution model adequacy in PhyloMAd
- roaches-v1-master-probe-list-DUPE-SCREENED.fasta.gz: Final deduplicated probe set (minimum identity threshold of 80% and a minimum coverage of 83%) cleaned from sequences with ambiguous base calls and GC-content above 70% or below 30%,
designed for all six considered genomes (319,243 baits targeting 30,059 loci). This deduplicated set was obtained with ‘phyluce_probe_get_tiled_probes’, ‘phyluce_probe_easy_lastz’ and ‘phyluce_probe_remove_duplicate_hits_from_probes_using_lastz’.
- ROA_UCE_DB_CONTRIB_1.fasta.gz: Extracted UCEs from all samples using the bait set provided in this study. This is contribution #1 for the Cockroach UCE Database. Each sample was assigned a unique identification code (ROA-X-UCEDB).
The database is maintained at: https://github.com/sihellem/ROA-UCE-DB/.
- ROA_UCE_DB_CONTRIB_1_ids.txt: unique identification codes for UCE samples (ROA-X-UCEDB)
For more information or if you just want to talk noctis please do not hesitate to contact me (Toby) at toby.kovacs@sydney.edu.au :)
To view the tree files we recommend using FigTree: https://github.com/rambaut/figtree/releases
I would open the 16S or mtPCG alignments in MEGA https://www.megasoftware.net/ or Seqotron https://github.com/4ment/seqotron/
I would avoid opening the alignment of UCEs, just let them be.
Missing data code in PhyloMAd output files: NA