Data from: Occurrence, morphology, and taxonomy of the late Cambrian Laurentian dikelocephalid trilobite Osceolia Walcott, 1914
Data files
Apr 29, 2025 version files 150.92 KB
-
classifier41collec.csv
2.13 KB
-
classifier41litho.csv
2.87 KB
-
classifier41Species.csv
2.40 KB
-
classifier62collec.csv
3.13 KB
-
classifier62ledge.csv
3.26 KB
-
classifier62litho.csv
3.85 KB
-
classifier62Species.csv
3.27 KB
-
Osc41PY.tps
20.28 KB
-
Osc62CrAMPOSymmNew.tps
64.40 KB
-
OscWINmCRNew.tps
20.81 KB
-
OscWINmPY.tps
6.94 KB
-
README.md
17.58 KB
Abstract
This dataset provides details of the occurrence of Osceolia Walcott, 1914 as discussed in the accompanying paper. This data includes the field notes of previous collections made and a list of specimen information associated with the repositories they are stored in. The table included here provides the detailed notes made by authors during the study of varied specimen and respository collections. We include bivariate plots of the variance explained by each relative warp for the sclerites analyzed and morphometric measurement error analysis. The raw data files used for running the geometric morphometrics analyses are also included in the supplementary material.
Dataset DOI: 10.5061/dryad.xwdbrv1rd
Description of the data and file structure
This dataset corresponds to a related article about a late Cambrian trilobite, Osceolia from the St. Lawrence Formation and the overlying Jordan Formation in the Upper Mississippi Valley. The material provides further support for the analytical results and discussions presented in the main study including specimen repository information, further information on the analysis of morphological variation in the specimen dataset, Osceolia locality information deemed not essential to include in the main text, field-notes of the earlier workers that collected data relevant to Osceolia occurrences and information about the geographic and stratigraphic occurrence of the nomen nudum "Osceolina diabolo". The sclerites of Osceolia analyzed here were collected on field excursions in Wisconsin and Minnesota by both amateur and professional paleontologists early in the last century. All Osceolia specimens were imaged in dorsal view with the palpebral lobe surface mounted horizontally and the pygidial axial furrow mounted horizontally, mostly using a Nikon SLR camera, either as 35 mm black and white negative or directly as digital images. In order to reveal the morphology more clearly, specimens were coated with ammonium chloride sublimate prior to being photographed. The image library was assembled intermittently between 1986 and 2024. Images were taken within the specimen’s host institution or at the University of California, Riverside. Negatives were digitally scanned with a Polaroid SprintScan 35 plus or an Epson Perfection V700 scanner. Raw data files for the geometric morphometric analyses are also provided in this dataset.
Files and variables
File: OscSupple_production.docx
Description: This file contains the various relevant supplementary data files combined together into a Microsoft Word file. The following is the description of each file included here:
Supplemental Data 1: Osceolia repositories and cataloged specimen list. This lists the specimen numbers of Osceolia specimens used in the main study along with the repository institutions that they are stored in. An explanation of the repository abbreviations is also provided to go along with the complete specimen numbers.
Supplementary Data 2: Osceolia locality details. These notes include information on localities collected and logged by the authors and also information gleaned from the various sources associated with the specimen repository collections. Also provided are the justifications of rejected Osceolia occurrences mentioned in historical collections and explanation of one special specimen that was assigned to be cf. Osceolia.
Supplementary Data 3: Measurement error analysis. The results of a simple test of variance of shape among selected specimens to quantify human measurement error when collecting geometric morphometric landmark data.
Supplementary Data 4: Osceola, Wisconsin and Minnesota combined Norwalk Member collections. Results of morphometric analysis of combined dataset of specimens from the Osceola locality from both sides, WI and MN just from the Norwalk Member of Jordan Formation are provided to support the results included in the main study.
Supplementary Data 5: Shape variation captured by relative warps in various datasets. Explanation of choice of relevant relative warp components for analysis discussed in the main study is provided.
Supplementary Data 6: Earlier worker field-notes for the St. Lawrence and Jordan formations at Osceola, Wisconsin. Images of archived material at the Smithsonian Institution Archives is provided which illustrates the field-work notes made by earlier workers including E.O. Ulrich, Charles D. Walcott and Charles Schuchert.
Supplementary Data 7: Notes on geographic ad stratigraphic occurrence of nomen nudum "Osceolina diabolo". Information from the Smithsonian Paleontology Collection Location file index cards and Smithsonian Institution Archives is provided as discussion of "Osceolina diabolo" occurrences in the Upper Mississippi Valley.
File: OscSupple2b_log_4_24_25.xlsx
Description: This table lists all the information that has been collected by authors from various sources such as the Smithsonian Institution Archives over the course of compiling the Osceolia specimen image library.
Variables
- Sheet Osceolia log contains type specimen information for all published studies with Osceolia specimens included in them.
- Inst/Specimen No: Repository abbreviation (see above) along with the reposited specimen number.
- Original Genus name: Genus group that the specimen was originally assigned to when published for the first time
- Original species name: Species group that the specimen was originally assigned to when published for the first time
- Subsequent illustration: Record of subsequent illustration of the same specimen if any
- Current genus: Genus group that the specimen is currently assigned to, either by Srivastava and Hughes' observations or by subsequent studies before Srivastava and Hughes
- Current species: Species group that the specimen is currently assigned to, either by Srivastava and Hughes' observations or by subsequent studies before Srivastava and Hughes
- Sclerite: Designation of specimen to either cranidium, free cheek, thoracic segment, hypostome or pygidium
- Part/ Counterpart: Designation of specimen to being either part or counterpart of the actual sclerite
- Locality: Information about locality where the specimen was collected from
- Co-occurrent taxa: Record of co-occurring taxa, if any
- Geological Unit and Facies: Stratigraphic information associated with the specimen
- Original Author/Year, Pages, Plate and Figure: Information of the original illustration of the specimen for the first time historically
- Film/Frame: Name of the image file according to the 35 mm black and white negative films collected in the 1980s
- Cast: Record if a latex cast was made if the specimen was a counterpart
- Scanned image: Record if an image exists in the digital image library for the specimen
- Jin-Bo Photo: Record if an extra copy of the image is stored in a colleague's (Jin-Bo Hou) digital image library
- Image quality: Record if the resolution of digital image is not satisfactory
- Notes: Any additional information associated with the specimen.
- Sheet USNM Locality Info contains relevant information collected about associated type localities with data reposited at the Smithsonian National Museum of Natural History
- USNM Locality: Locality code as assigned and recorded at the museum repository
- Age: Geological age of the locality
- Geologic Unit: Stratigraphic unit bearing fossils at the locality
- Location: Detailed description of the locality
- Fossils found: List of fossils collected at the locality
- Sheet Non-fig-Fig specimen localities contains detailed information about the non-type localities
- Collection: Museum abbreviation referring to either where the specimen is reposited (also see Supplementary Data 1) or where the detailed information of the locality is reposited.
- Locality: Locality name and description
File: OscWINmCRNew.tps
A tps file listing landmark data used for morphometric analysis of 20 cranidia from Osceola, WI, Norwalk Member collection. The data listed is in x-y coordinate format.
File: OscWINmPY.tps
A tps file listing landmark data used for morphometric analysis of 14 pygidia from Osceola, WI, Norwalk Member collection. The data listed is in x-y coordinate format.
File: Osc62CrAMPOSymmNew.tps
A tps file listing landmark data used for morphometric analysis of 62 cranidia from all collections of O. osceola pooled together to analyze the shape variation of the species as a whole. The data listed is in x-y coordinate format.
File: Osc41PY.tps
A tps file listing landmark data used for morphometric analysis of 41 pygidia from all collections of O. osceola pooled together to analyze the shape variation of the species as a whole. The data listed is in x-y coordinate format.
File: classifier62collec.csv
A '.csv' file (required format to read classifier files in ggplot2 package in R) listing the locality designation of each of the 62 cranidia analyzed in the main study.
Variables
- ID: Specimen ID of specimens used in the morphometrics analysis listed in sequence as per in the 'Osc62CrAMPOSymmNew.tps' file.
- Image: Image file name in image library of all Osceolia specimens.
- Locality: Specimen locality named according to locality names listed in 'OscSupple2b_log_4_24_25.xlsx'
- PoinColor: Label color designated to each specimen according to their locality for relevant PCA plots as shown in the main text. Codes/ names used follow conventions for using ggplot2 package in R.
- PoinShape: Label shape designated to each specimen according to their locality for relevant PCA plots as shown in the main text. Codes/ names used follow conventions for using ggplot2 package in R.
File: classifier62ledge.csv
A '.csv' file (required format to read classifier files in ggplot2 package in R) listing the ledge class designation of each of the 62 cranidia analyzed in the main study.
Variables
- ID: Specimen ID of specimens used in the morphometrics analysis listed in sequence as per in the 'Osc62CrAMPOSymmNew.tps' file.
- Image: Image file name in image library of all Osceolia specimens.
- Ledge: Cranidial anterior border ledge class designated to each specimen as discussed in the main text.
- PoinColor: Label color designated to each specimen according to their ledge class for relevant PCA plots as shown in the main text. Codes/ names used follow conventions for using ggplot2 package in R.
- PoinShape: Label shape designated to each specimen according to their ledge class for relevant PCA plots as shown in the main text. Codes/ names used follow conventions for using ggplot2 package in R.
File: classifier62litho.csv
A '.csv' file (required format to read classifier files in ggplot2 package in R) listing the lithology designation of each of the 62 cranidia analyzed in the main study.
Variables
- ID: Specimen ID of specimens used in the morphometrics analysis listed in sequence as per in the 'Osc62CrAMPOSymmNew.tps' file.
- Image: Image file name in image library of all Osceolia specimens.
- Locality: Specimen locality named according to locality names listed in 'OscSupple2b_log_4_24_25.xlsx'
- Lithology: Lithology of each specimen.
- PoinColor: Label color designated to each specimen according to their lithology for relevant PCA plots as shown in the main text. Codes/ names used follow conventions for using ggplot2 package in R.
- PoinShape: Label shape designated to each specimen according to their lithology for relevant PCA plots as shown in the main text. Codes/ names used follow conventions for using ggplot2 package in R.
File: classifier62Species.csv
A '.csv' file (required format to read classifier files in ggplot2 package in R) listing the type species or non-type species designation according to previous illustrations and systematic descriptions of Osceolia specimens included in the 62 cranidia analyzed in the main study.
Variables
- ID: Specimen ID of specimens used in the morphometrics analysis listed in sequence as per in the 'Osc62CrAMPOSymmNew.tps' file.
- Image: Image file name in image library of all Osceolia specimens.
- Locality: Specimen locality named according to locality names listed in 'OscSupple2b_log_4_24_25.xlsx'
- Author: Original author that published each specimen.
- Species: Original type species name of each previously published specimen.
- PoinColor: Label color designated to each specimen according to their previous species designation for relevant PCA plots as shown in the main text. Codes/ names used follow conventions for using ggplot2 package in R.
- PoinShape: Label shape designated to each specimen according to their previous species designation for relevant PCA plots as shown in the main text. Codes/ names used follow conventions for using ggplot2 package in R.
File: classifier41collec.csv
A '.csv' file (required format to read classifier files in ggplot2 package in R) listing the locality designation of each of the 41 pygidia analyzed in the main study.
Variables
- ID: Specimen ID of specimens used in the morphometrics analysis listed in sequence as per in the 'Osc41PY.tps' file.
- Image: Image file name in image library of all Osceolia specimens.
- Locality: Specimen locality named according to locality names listed in 'OscSupple2b_log_4_24_25.xlsx'
- PoinColor: Label color designated to each specimen according to their locality for relevant PCA plots as shown in the main text. Codes/ names used follow conventions for using ggplot2 package in R.
- PoinShape: Label shape designated to each specimen according to their locality for relevant PCA plots as shown in the main text. Codes/ names used follow conventions for using ggplot2 package in R.
File: classifier41litho.csv
A '.csv' file (required format to read classifier files in ggplot2 package in R) listing the lithology designation of each of the 41 pygidia analyzed in the main study.
Variables
- ID: Specimen ID of specimens used in the morphometrics analysis listed in sequence as per in the 'Osc41PY.tps' file.
- Image: Image file name in image library of all Osceolia specimens.
- Locality: Specimen locality named according to locality names listed in 'OscSupple2b_log_4_24_25.xlsx'
- Level: Level defined within the locality in previous column from where each specimen was collected.
- Lithology: Lithology of each specimen.
- PoinColor: Label color designated to each specimen according to their lithology for relevant PCA plots as shown in the main text. Codes/ names used follow conventions for using ggplot2 package in R.
- PoinShape: Label shape designated to each specimen according to their lithology for relevant PCA plots as shown in the main text. Codes/ names used follow conventions for using ggplot2 package in R.
File: classifier41Species.csv
A '.csv' file (required format to read classifier files in ggplot2 package in R) listing the type species or non-type species designation according to the previous illustrations and systematic descriptions of Osceolia specimens included in the 41 pygidia analyzed in the main study.
Variables
- ID: Specimen ID of specimens used in the morphometrics analysis listed in sequence as per in the 'Osc41PY.tps' file.
- Image: Image file name in image library of all Osceolia specimens.
- Locality: Specimen locality named according to locality names listed in 'OscSupple2b_log_4_24_25.xlsx'
- Author: Original author that published each specimen.
- Species: Original type species name of each previously published specimen.
- PoinColor: Label color designated to each specimen according to their previous species designation for relevant PCA plots as shown in the main text. Codes/ names used follow conventions for using ggplot2 package in R.
- PoinShape: Label shape designated to each specimen according to their previous species designation for relevant PCA plots as shown in the main text. Codes/ names used follow conventions for using ggplot2 package in R.
- PoinStroke: Color of label outline for each specimen in the relevant PCA plots as discussed in the main text.
Code/software
Description of methods used for collection/generation of data:
Measurement error analysis conducted using traditional landmark data obtained from tpsdig (http://www.sbmorphometrics.org/soft-dataacq.html, '.tps' files) from three selected cranidia specimens with 28 landmarks and measured ten times each. Shape variance was calculated using the 8th version of Integrated Morphometrics Package (Sheets, 2014; https://www.animal-behaviour.de/imp/).
Cartesian X and Y coordinates of landmark data for growth variation explained by relative warps was obtained from tpsdig for the 20 cranidia and 14 pygidia from Osceola, Wisconsin Norwalk Member, and 62 cranidia and 41 pygidia from the entire collection ('.tps' files).
Classifier files used by ggplot2 package in R require the file format '.csv'. Microsoft Excel was used to generate this type of file.
Methods for processing the data:
Morphometric analysis of the landmark dataset was conducted using RStudio interface in R (https://www.rstudio.com). In R, the package used for statistical analysis were geomorph (Baken et al., 2021; Adams et al., 2021) for most of the morphometric functions, Residual Randomization Permutation Package (Collyer and Adams, 2018, 2021) for fitting linear regression models and ggplot2 package to plot specimens as classified according to certain relevant parameters (Wickham, 2016).
