Data from: Affinity in Concentric Circles (ACC): A geometric representation of dendrogram and interpretation
Data files
Jun 01, 2026 version files 16.25 MB
-
Appendix_S10.xlsx
187.72 KB
-
Appendix_S10.zip
764.02 KB
-
Appendix_S11.xlsx
171.67 KB
-
Appendix_S11.zip
705.57 KB
-
Appendix_S12.xlsx
170.63 KB
-
Appendix_S12.zip
724.48 KB
-
Appendix_S13.xlsx
171.49 KB
-
Appendix_S13.zip
698.29 KB
-
Appendix_S14.xlsx
176.68 KB
-
Appendix_S14.zip
674.13 KB
-
Appendix_S15.xlsx
174.45 KB
-
Appendix_S15.zip
704.20 KB
-
Appendix_S16.xlsx
213.25 KB
-
Appendix_S16.zip
817.28 KB
-
Appendix_S17.xlsx
98.39 KB
-
Appendix_S17.zip
8.48 KB
-
Appendix_S1a.pdf
60.16 KB
-
Appendix_S1b.pdf
60.08 KB
-
Appendix_S1c.pdf
58.84 KB
-
Appendix_S1d.pdf
60.75 KB
-
Appendix_S1e.pdf
69.70 KB
-
Appendix_S2a.pdf
59.88 KB
-
Appendix_S2b.pdf
58.60 KB
-
Appendix_S2c.pdf
60.32 KB
-
Appendix_S2d.pdf
68.27 KB
-
Appendix_S3a.pdf
50.54 KB
-
Appendix_S3b.pdf
50.56 KB
-
Appendix_S3c.pdf
50.59 KB
-
Appendix_S3d.pdf
50.56 KB
-
Appendix_S3e.pdf
50.59 KB
-
Appendix_S4.xlsx
54.63 KB
-
Appendix_S4.zip
389.86 KB
-
Appendix_S5.csv
289 B
-
Appendix_S5.xlsx
11.37 KB
-
Appendix_S6.xlsx
34.35 KB
-
Appendix_S6.zip
95.92 KB
-
Appendix_S7.xlsx
187.12 KB
-
Appendix_S7.zip
766.77 KB
-
Appendix_S8.xlsx
188.02 KB
-
Appendix_S8.zip
774.22 KB
-
Appendix_S9.xlsx
189.70 KB
-
Appendix_S9.zip
771.88 KB
-
Figure_S1.jpg
406.65 KB
-
Figure_S2.jpg
1.87 MB
-
Figure_S3.jpg
1.62 MB
-
README.md
14.34 KB
-
Supporting_information.pdf
908.57 KB
-
Table_S1.pdf
124.59 KB
-
Table_S2.pdf
115.82 KB
-
Table_S3.pdf
187.28 KB
-
Table_S4.pdf
108.90 KB
-
Table_S5.pdf
158.85 KB
Abstract
We introduce “Affinity in Concentric Circles” (ACC), a method that converts dendrogram to a geometric form using similarity scores to compare dendrograms and identify affinity change pattern. Study location: China for trilobites and global for nisusiid brachiopods. Study taxon: Trilobites (Ordovician of China) and brachiopods (Cambrian of worldwide). ACC compares dendrograms derived from subset datasets against a dendrogram from an inclusive dataset that aggregates all subset datasets. It employs two similarity scores, local and global scores for each cluster in the subset dendrogram. The local score is the original similarity value of the cluster calculated from subset datasets, whereas the global score for the same cluster is recalculated from the similarity values of the inclusive dataset. The local score is converted to an angle along a circle, and the global score to a diameter of the circle, respectively. This geometric conversion places two areas of a cluster at an angle along a circle. The procedure begins with a cluster with the highest local score and then sequentially adds area or merges clusters by centering and aligning the circles. As a result, areas in each subset dendrogram are placed along concentric circles at angles. Affinities in the ACC diagram are quantified using two metrics: pairwise distance (PD) between two members in a subset diagram and comprehensive travel distance (CTD) of a member across multiple subset dendrograms. Affinities among members are compared both within and across diagrams based on geometric distribution, and relative changes in the affinities are tracked by comparing average PD and CTD values. The ACC analyses of six trilobite-occurring areas in China for four Ordovician epochs and three Ordovician evolutionary faunas reveal that Jiangnan slope, which is closest to Tianshan basin, was likely a center of diversification, Sino-Korea platform was biogeographically isolated, Qaidam was closer to Sino-Korea, and deep-water areas were more strongly connected by active oceanic currents in depth. The average CTD values suggest that significant change in overall biogeographic stability occurred in the Arenig and Llanvirn. The analysis of seven nisusiid brachiopod-occurring areas for four middle Cambrian ages reveals that Laurentia, the likely center of origin, became progressively isolated from other areas, Australia with the lowest average CTD underwent the most intense biogeographic changes, and overall biogeographic stability decreased toward the Drumian. These findings are largely inconsistent with the area distributions in the paleomaps. The distribution sector of ACC diagrams reflects β-diversity for each period and fauna. ACC represents and addresses Simpson’s paradox in a unique manner and tends to reduce its manifestation. ACC utilizes all clusters and their similarity scores of dendrograms from both subset and inclusive datasets for analysis, allowing us to compare dendrograms and identify patterns that are not apparent in dendrograms.
Dataset DOI: 10.5061/dryad.s4mw6m9k3
Description of the data and file structure
Appendices S1 to S3 are datasets used for analysis in PAST 4.03. Appendices S4 to S17 are the procedures in Excel formulas. Figures S1 to S3 are supplementary figures for the article. Tables S1 to S5 are supplementary tables for the article.
Files and variables
File: Appendix_S1a.pdf
Description: Dataset of presence/absence of Ordovician trilobite genera in the Tremadoc.
File: Appendix_S1b.pdf
Description: Dataset of presence/absence of Chinese trilobite genera in the Arenig.
File: Appendix_S1c.pdf
Description: Dataset of presence/absence of Chinese trilobite genera in the Llanvirn.
File: Appendix_S1d.pdf
Description: Dataset of presence/absence of Chinese trilobite genera in the Caradoc.
File: Appendix_S1e.pdf
Description: Dataset of presence/absence of Chinese trilobite genera in the Ordovician.
File: Appendix_S2a.pdf
Description: Dataset of presence/absence of Ibex-I fauna.
File: Appendix_S2b.pdf
Description: Dataset of presence/absence of Ibex-II fauna.
File: Appendix_S2c.pdf
Description: Dataset of presence/absence of Whiterock fauna.
File: Appendix_S2d.pdf
Description: Dataset of presence/absence of aggregated fauna.
File: Appendix_S3a.pdf
Description: Dataset of presence/absence of nisusiid brachiopod species in Stage 3.
File: Appendix_S3b.pdf
Description: Dataset of presence/absence of nisusiid brachiopod species in Stage 4.
File: Appendix_S3c.pdf
Description: Dataset of presence/absence of nisusiid brachiopod species in Wuliuan.
File: Appendix_S3d.pdf
Description: Dataset of presence/absence of nisusiid brachiopod species in Drumian.
File: Appendix_S3e.pdf
Description: Dataset of presence/absence of nisusiid brachiopod species in middle Cambrian.
File: Appendix_S4.xlsx
Description: ACC procedure in Figures 2 and 3 explained using Excel formulas. (a) How to add area to cluster using ((A + B) + C) (Figure 2). (b) How to merge two clusters with ((A + B) + (C + D)) (Figure 3). (c) Change in initial and final pairwise distance (PD) of BC and AC with varying local scores of BC and fixed score of AC for the example in (a) and XY plots with linear trend. (d) Change in initial and final pairwise distance (PD) of BD and BC with varying local scores of BD and fixed score of BC for the example in (b) and XY plots with linear trend. (e) Correlation of local scores and pairwise distances for the examples in (a) and (b) and XY plots. Formulas: θ = 180 X (1-local score), d = 1 + 10 X (1-global score), PD = 2 × π × r × (θ / 360) where r = d / 2, PD = 2 × π × r1 × (θ / 360) + |r1 – r2| where r1 is the smaller radius. Unit of θ is degree and d is unitless.
File: Appendix_S4.zip
Description: Zip file of csv files and figures of charts in Appendix_S4.
File: Appendix_S5.xlsx
Description: Data and formulas in excel format for PD and TD calculation illustrated in Figure 4. TD = 2 × π × r × (θ / 360) + |r1 – r2| where r1 is the smaller radius, and see Appendix 4 for formulas for θ, d, and PD.
File: Appendix_S5.csv
Description: Appendix_S5 in csv format.
File: Appendix_S6.xlsx
Description: α-, β-, and γ-diversity data. (a) Ordovician trilobite genera in six areas in China and four epochs and histogram of calculated β-diversity. (b) Three Ordovician evolutionary fauna in six areas in China histogram of calculated β-diversity. (c) Middle Cambrian brachiopod nisusiid species in seven areas and four ages histogram of calculated β-diversity. β-diversity = γ-diversity / average of α-diversity.
File: Appendix_S6.zip
Description: Zip file of csv files and figures of charts in Appendix_S6.
File: Appendix_S7.xlsx
Description: Data and formulas in excel format for analysis of data in Appendix S1 using Simpson index. (a) Tremadoc. (b) Arenig. (c) Llanvirn. (d) Caradoc. (e) β-diversity estimated by the area of the sector of ACC circles and reciprocal of average of local scores and histograms of estimated β-diversity and reciprocal of local score averages. (f) Comprehensive travel distance (CTD) of each area using all other areas as fixed areas and average PD and histograms of average CTD1, CTD2 and PD. (g–l) Travel distance (TD) using different areas as a fixed area and histograms of TD and total TD: (g) Tianshan basin, (h) Sino-Korea platform, (i) Ordos basin, (j) Qaidam, (k) Yangtze platform, and (l) Jiangnan slope. (m) Local scores and pairwise distances of area pairs and XY plots for four epochs. Abbreviation of epochs: T = Tremadoc, A = Arenig, L = Llanvirn, C = Caradoc. Abbreviations of areas: S = Sino-Korea platform, O = Ordos basin, Q = Qaidam, T = Tianshan basin, Y = Yangtze platform, J = Jiangnan slope.
File: Appendix_S7.zip
Description: Zip file of csv files and figures of charts in Appendix_S7.
File: Appendix_S8.xlsx
Description: Data and formulas in excel format for analysis of data in Appendix S1 using Jaccard index. See Appendix S7 for captions for (a) to (m) and abbreviations of epochs and areas.
File: Appendix_S8.zip
Description: Zip file of csv files and figures of charts in Appendix_S8.
File: Appendix_S9.xlsx
Description: Data and formulas in excel format for analysis of data in Appendix S1 using Ochiai index. See Appendix S7 for captions for (a) to (m) and abbreviations of epochs and areas.
File: Appendix_S9.zip
Description: Zip file of csv files and figures of charts in Appendix_S9.
File: Appendix_S10.xlsx
Description: Data and formulas in excel format for analysis of data in Appendix S1 using Raup-Crick index. See Appendix S7 for captions for (a) to (m) and abbreviations of epochs and areas.
File: Appendix_S10.zip
Description: Zip file of csv files and figures of charts in Appendix_S10.
File: Appendix_S11.xlsx
Description: Data and formulas in excel format for analysis of data in Appendix S2 using Simpson index (with JO as the initial sequence). (a) Ibex-I fauna. (b) Ibex-II fauna. (c) Whiterock fauna. (d) β-diversity estimated by the area of the sector of ACC circles and reciprocal of average of local scores and histograms of estimated β-diversity and reciprocal of local score averages. (e) Comprehensive travel distance (CTD) of each area using all other areas as fixed areas and average PD and histograms of average CTD1, CTD2 and PD. (f–k) Travel distance using different areas as a fixed area and histograms of TD and total TD: (f) Tianshan basin, (g) Sino-Korea platform, (h) Ordos basin, (i) Qaidam, (j) Yangtze platform, and (k) Jiangnan slope. (l) Local scores and pairwise distances of area pairs and XY plots for three evolutionary faunas. Abbreviation of evolutionary faunas: I-I = Ibex-I fauna, I-II = Ibex-II fauna, W = Whiterock fauna. See Appendix S7 for abbreviations of areas.
File: Appendix_S11.zip
Description: Zip file of csv files and figures of charts in Appendix_S11.
File: Appendix_S12.xlsx
Description: Data and formulas in excel format for analysis of data in Appendix S2 using Simpson index (with QO as the initial sequence). See Appendix S11 for captions for (a) to (l) and abbreviations of evolutionary faunas and areas.
File: Appendix_S12.zip
Description: Zip file of csv files and figures of charts in Appendix_S12.
File: Appendix_S13.xlsx
Description: Data and formulas in excel format for analysis of data in Appendix S2 using Jaccard index. See Appendix S11 for captions for (a) to (l) and abbreviations of evolutionary faunas and areas.
File: Appendix_S13.zip
Description: Zip file of csv files and figures of charts in Appendix_S13.
File: Appendix_S14.xlsx
Description: Data and formulas in excel format for analysis of data in Appendix S2 using Ochiai index. See Appendix S11 for captions for (a) to (l) and abbreviations of evolutionary faunas and areas.
File: Appendix_S14.zip
Description: Zip file of csv files and figures of charts in Appendix_S14.
File: Appendix_S15.xlsx
Description: Data and formulas in excel format for analysis of data in Appendix S2 using Raup-Crick index. See Appendix S11 for captions for (a) to (l) and abbreviations of evolutionary faunas and areas.
File: Appendix_S15.zip
Description: Zip file of csv files and figures of charts in Appendix_S15.
File: Appendix_S16.xlsx
Description: Data and formulas in excel format for analysis of data in Appendix S3 using Raup-Crick index. (a) Stage 3. (b) Stage 4. (c) Wuliuan. (d) Drumian. (e) β-diversity estimated by the area of the sector of ACC circles and reciprocal of average of local scores and histograms of estimated β-diversity and reciprocal of local score averages. (f) Comprehensive travel distance (CTD) of each area using all other areas as fixed areas and average PD and histograms of average CTD1, CTD2 and PD. (g–m) Travel distance using different areas as a fixed area and histograms of TD and total TD: (g) Siberia, (h) South China, (i) Australia, (j) High latitude peri-Gondwana, (k) Low latitude peri-Gondwana, (l) Sino-Korea, and (m) Laurentia. (n) Local scores and pairwise distances of area pairs and XY plots for four ages. Abbreviation of ages: 3 = Stage 3, 4 = Stage 4, W = Wuliuan, D = Drumian. Abbreviations of areas: L = Laurentia, h = high latitude peri-Gondwana, l = low latitude peri-Gondwana, C = South China, B = Siberia, S = Sino-Korea, A = Australia.
File: Appendix_S16.zip
Description: Zip file of csv files and figures of charts in Appendix_S16.
File: Appendix_S17.xlsx
Description: Data for analysis in Figure 13 and 14 in relation to Simpson's paradox. Local and global score data for analysis of the Ordovician trilobite datasets with respect to Simpson’s paradox. (a) Pairwise distance change in Figure 13. (b) Pre-conversion, incomplete aggregation dataset (Ordovician trilobite genera in six areas in China for four epochs) (Figure 14a). (c) Pre-conversion, complete aggregation datasets (Ordovician trilobite genera in six areas in China for three evolutionary fauna) (Figure 14a). (d) Post-conversion incomplete and complete aggregation datasets (Figure 14b, c). (e) Slope gradient and R2. Abbreviations in (a): G = Global score, L = Local score, D = Distance. Abbreviations of time in (b): O = Ordovician and see Appendix S7 for abbreviations of epochs. Abbreviations of faunas in (c): A = Aggregated fauna and see Appendix S11 for abbreviations of faunas. Abbreviations in (d) and (e): A = Number of areas per cluster, T = Total number of clusters, Diff. = Difference between local and global scores, SL = Slope gradient of linear trend, R2 =Square of the Pearson product-moment correlation coefficient.
File: Appendix_S17.zip
Description: Zip file of csv files and figures of charts in Appendix_S17.
File: Figure_S1.jpg
Description: ACC diagrams of six areas in China for Ordovician epochs using Simpson index (with TJ as the initial sequence).
File: Figure_S2.jpg
Description: ACC diagrams of six areas in China for Ordovician epochs using Jaccard, Ochiai, and Raup-Crick index.
File: Figure_S3.jpg
Description: ACC diagrams of six areas in China for three Ordovician trilobite evolutionary faunas using Jaccard, Ochiai, and Raup-Crick index.
File: Table_S1.pdf
Description: Pairwise distance (PD) of six areas in four Ordovician epochs in increasing order (see Appendices S7‒S10). Abbreviations of epochs: Tr = Tremadoc, Ar = Arenig, Ll = Llanvirn, Ca = Caradoc. See Appendix S7 for abbreviations of areas. PD = 2 × π × r1 × (θ / 360) + |r1 – r2|.
File: Table_S2.pdf
Description: Comprehensive travel distance of six areas in three Ordovician intervals. Comprehensive travel distance (CTD) of six areas in three Ordovician intervals; the areas are arranged in descending order of the average (see Appendices S7‒S10 for data); CTD1 and CTD2 indicates the average CTD1 and CTD2, respectively (see Table 1). Abbreviations of intervals: 1 = Tremadoc to Arenig, 2 = Arenig to Llanvirn, 3 = Llanvirn to Caradoc. See Appendix S7 for abbreviations of areas.
File: Table_S3.pdf
Description: Pairwise distance (PD) of six areas for three trilobite evolutionary faunas in increasing order (see Appendices S11‒S15 for data). Abbreviations of faunas: I = Ibex I (I-1 and I-2 are based on the ACC diagrams generated using JO and QO as the first area sequence), II = Ibex II, W = Whiterock. See Appendix S7 for abbreviations of areas.
File: Table_S4.pdf
Description: Comprehensive travel distance (CTD) of six areas in three evolutionary fauna pairs in descending order of the average (see Appendices S11‒S15 for data); CTD1 and CTD2 indicates average CTD1 and CTD2, respectively (see Table 1). Simpson-1 and 2 are based on the ACC diagrams generated using QO and JO as the first area sequence. Abbreviation of fauna: I = Ibex-I fauna, II = Ibex-II fauna, W = Whiterock fauna. See Appendix S7 for abbreviations of areas.
File: Table_S5.pdf
Description: Pairwise distance (PD) of seven areas in four middle Cambrian ages in increasing order, and comprehensive travel distance (CTD) of seven areas in three middle Cambrian intervals in decreasing order of the average (see Appendix S16 for data); CTD1 and CTD2 indicates average CTD1 and CTD2, respectively (see Table 1). Abbreviations of ages: 3 = Stage 3, 4 = Stage 4, W = Wuliuan, D = Drumian. Abbreviations of intervals: First = Stage 3 to 4, Second = Stage 4 to Wuliuan, Third = Wuliuan to Drumian. See Appendix S16 for abbreviations of areas.
File: Supporting_information.pdf
Description: Captions for supplementary material
Code/software
PAST 4.03
