This Sehgal et al_2022_readme.txt file was generated on 2022-01-21 by Ramesh Kumar Sehgal, Abhishek Pratap Singh, Christopher C. Gilbert, Biren A. Patel, Christopher J. Campisano, Keegan R. Selig, Rajeev Patnaik and Ningthoujam Premjit Singh GENERAL INFORMATION 1. Title of Dataset: A new genus of treeshrew and other micromammals from the middle Miocene hominoid locality of Ramnagar, Udhampur District, Jammu & Kashmir, India 2. Author Information Principal Investigator Contact Information Name: Ramesh Kumar Sehgal Institution: Wadia Institute of Himalayan Geology, Dehradun-248001, India Email: rksehgal@wihg.res.in Co-investigator 1 Name: Abhishek Pratap Singh Institution: Wadia Institute of Himalayan Geology, Dehradun-248001, India Email: apsgeo002@gmail.com Co-investigator 2 Name: Christopher C. Gilbert Institution: Department of Anthropology, Hunter College of the City University of New York, 695 Park Avenue, New York, NY 10065, USA Email: cgilbert@hunter.cuny.edu Co-investigator 3 Name: Biren A. Patel Institution: Department of Integrative Anatomical Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA Email: birenpat@usc.edu Co-investigator 4 Name: Christopher J. Campisano Institution: Institute of Human Origins, Arizona State University, Tempe, AZ 85287, USA Email: campisano@asu.edu Co-investigator 5 Name: Keegan R. Selig Institution: New York Consortium in Evolutionary Primatology, New York, NY, USA Email: kselig@amnh.org Co-investigator 6 Name: Rajeev Patnaik Institution: Department of Geology, Panjab University, Chandigarh-160014, India Email: rajeevpatnaik@gmail.com Co-investigator Name: Ningthoujam Premjit Singh Institution: Wadia Institute of Himalayan Geology, Dehradun-248001, India Email: ningthoujampremjit11@gmail.com 3. Date of data collection: 2019-2021 4. Geographic location of data collection: India 5. Information about funding sources that supported the collection of the data: Wadia Institute of Himalayan Geology, Dehradun, India; the Leakey Foundation, the PSC-CUNY faculty award program, Hunter College; and the U.S. National Science Foundation DATA & FILE OVERVIEW 1. Description of dataset This dataset were generated to represent a new genus and species of fossil treeshrew, specimens of the hedgehog Galerix, and other micromammals from the middle Miocene (Lower Siwalik) deposits surrounding Ramnagar (Udhampur District, Jammu & Kashmir), at a fossil locality known as Dehari. The treeshrew from Dehari (Sivatupaia ramnagarensis gen. nov. et sp. nov.) currently represents the oldest record of fossil tupaiids in the Siwaliks, extending their time range by ~2.5-4.0 million years in the region. Dietary analyses suggest that the new tupaiid was likely adapted for a less mechanically challenging or more frugivorous diet compared to other extant and fossil tupaiids. The occurrence of Galerix has only been recently documented from the Indian Siwaliks and the Dehari specimens help establish the likely presence of a relatively large Siwalik Galerix species in the Ramnagar region. In addition to the new treeshrew and hedgehogs, new specimens of the rodents Kanisamys indicus, Sayimys sivalensis, and Murinae indet. from Dehari help confirm age estimates for the Ramnagar region equivalent to the Chinji Formation in Pakistan, most likely corresponding to the middle to upper part of the Chinji Formation. 2. File List: File 1 name: SI Dataset 1 File 1 Description: Generalized Procrustes Analysis data of treeshrews. File 2 Name: SI Figure S1 File 2 Description: Plot of M2 shape vs. M2 Size in G. rutlandae (Black dots), G. wesselsae (Blue squares), Galerix sp. indet. from Potwar locality Y709 (purple cross), and Galerix sp. indet. from Kulwanta locality K2, Ramnagar, India (Red star). Note that the Ramnagar specimen is larger than G. rutlandae despite exhibiting similar occlusal features. File 3 name: SI Table S1 File 3 Description: List of specimens included in the analyses and results of the topographic analysis for Dirichlet normal energy (DNE), three-dimensional orientation patch count rotated (3D-OPCR), and relief index (RFI). MorphoSource/ ARCTOS information is given when available. Where specimens are listed as not being used in both analyses, they were only used in the DTA analysis with the exception of the single specimens of Tupaia picta, Tupaia montana, and Prodendrogale yunnanica, which were only included in the GMA. File 4 Name: SI Table S2 File 4 Description: Comparative dental measurements in mm of Galerix rutlandae and Galerix wesselsae M2s from Pakistan and India. File 5 Name: SI Table S3 File 5 Description: Comparative dental measurements (mm) of WIMF/A 4696, WIMF/A 4693, and WIMF/A4692 and previously described Lower Siwalik fossil murines. METHODOLOGICAL INFORMATION The micromammal teeth were recovered by macerating approximately 200 kilograms of sediments from Dehari in the Biostratigraphy Lab at the Wadia Institute of Himalayan Geology (WIHG), Dehradun (India), originally collected by R. K. Sehgal during the 2017 and 2019 field seasons. The sediments were broken into fragments and then soaked in plastic tubs with buffered acetic acid and water. The loose material was then wet sieved through 20, 40 and 60 mesh sieves (ISTM). The material collected in the sieves was dried in the sun and microvertebrates were sorted using a fine brush under a binocular microscope housed at WIHG. The specimens described in this paper are housed in the WIHG, and bear the acronym WIMF/A (Wadia Institute Micro Fossil Series A). In order to facilitate the study of these small micromammal teeth, three-dimensional (3D) imaging was obtained using high-resolution micro-CT (µCT), housed in the Molecular Imaging Center of the Keck School of Medicine of the University of Southern California (Los Angeles, CA, USA). Each fossil tooth was scanned individually within a plastic sample holder (i.e., centrifuge tube), with the specimen held securely in place using foam and soft cotton to prevent movement artefacts during the scan. Scans were obtained with a GE phoenix nanotom m system (GE Inspection Technologies, Lewistown, PA, USA) with the following parameters: voltage = 120; current = 70; filter = 2.5 mm Al + Al 0.5 mm; averaging = 2; magnification = 27.778-33.335; isometric voxel dimensions = 0.00299-0.00359 mm. 3D surface renderings of each specimen were created in Amira 3D v.2021.1 software (Thermo Fisher Scientific, Inc., Waltham, MA, USA) from image stacks of 16-bit unsigned DICOM images after undergoing a median (2 pixel) filter. These 3D surfaces as shown in Figs. 2, 7 and 8 are available to download from MorphoSource (www.morphosource.org) as part of the project “Siwalik Fossils from Ramnagar (Jammu & Kashmir), India”. Following the methods of Selig et al. (2019a, 2019b, 2020), 3D geometric morphometric (3DGM) analyses were conducted of the new treeshrew specimen (WIMF/A 4699) in relation to a large sample of extant and fossil treeshrew m2s, where available. The same 18 landmarks taken by Selig et al. (2020) were collected on the WIMF/A 4699 in the Avizo v.8.1.1 software (Thermo Fisher Scientific, Inc., Waltham, MA) using the landmark editor function, and then the landmark data for this specimen were added to the accessible sample (n=46) from the Selig et al. (2020) dataset (i.e., two specimens, Ptilocercus lowii YPM MAM 10179 and Dendrogale murina UAM: Mamm 103000 had to be excluded; see Supporting Information [SI] Table 1 and SI Dataset 1 for list of included specimens). A Generalized Procrustes Analysis (GPA) was performed to scale, rotate, and translate the landmark data, followed by a Principal Components Analysis (PCA) in the software package Morphologika2 (O’Higgins and Jones, 2006) using wireframes to visualize differences in shape along the resulting PC axes. For visualization, the Procrustes coordinates were also submitted to a PCA in the software package PAST v.4.03 (Hammer et al., 2001) along with a UPGMA cluster analysis based on the generic-level averages of the first five PC scores among the included taxa to assess phenetic affinities. In order to make inferences of dietary adaptations about the fossil treeshrew, Dirichlet normal energy (DNE), 3D orientation patch count rotated (3D-OPCR), and relief index (RFI) were measured in WIMF/A 4699 relative to a large sample of other extant and fossil treeshrews previously measured by Selig et al. (2020) (see SI Table 1). These methods are among a suite of dental topographic metrics that quantify functional aspects of the occlusal surface of teeth that can be tied directly to dietary adaptations (Ungar and M’Kirera, 2003; Evans et al., 2007; Boyer, 2008; Bunn et al., 2011, Winchester, 2016). For example, DNE quantifies occlusal curvature, so teeth with sharper crests and cusps have higher DNE values. High DNE values relate to more mechanically challenging diets, such as insects, whereas lower values relate to the processing of soft foods such as fruit (Bunn et al., 2011; Winchester, 2016). 3D-OPCR is a measure of surface complexity, so teeth with more crests, cusps, and ridges have higher values. High 3D-OPCR values also relate to insectivory, whereas low values relate to the consumption of softer foods such as fruits (Evans et al., 2007; Winchester, 2016). Finally, RFI is a measure of relative crown height and teeth with relatively taller cusps or relatively taller teeth overall have higher RFI values. High RFI values relate to the consumption of insects, whereas low values relate to the consumption of softer foods (Ungar and M’Kirera, 2003; Boyer, 2008). Prior to analysing WIMF/A 4699 for topographic analyses, the 3D surface was simplified to 10,000 faces and smoothed to 100 iterations with a lambda at 0.6 following the protocol of Selig et al. (2019b, 2020). All three topographic metrics were measured using MorphoTester v.1.1.1 software (Winchester, 2016) using the default settings for DNE and the patch count set at 5 for 3D-OPCR. A second PCA was performed on the covariance matrix including the species means of these three topographic metrics, thus allowing the visualization of overall variation in dental topography across species in our sample. All 3DGM and dental topographic data were collected by a single observer (KRS). See Selig et al. (2020) for additional methodological details. The dental terminologies used here follow Jacobs (1978) for murids; Ziegler (1990) for erinaceids; Jacobs (1978), Flynn (1982a) and López-Antoñanzas et al. (2013) for rhizomyines; Baskin (1996) and López-Antoñanzas and Knoll (2011) for ctenodactylines; and Jacobs (1980) for tupaiids. DATA-SPECIFIC INFORMATION FOR: SI Dataset 1 1. Number of variables: None 2. Number of cases/rows: None 3. Variable List: None 4. Missing data codes: None 5. Specialized formats or other abbreviations used: DATA-SPECIFIC INFORMATION FOR: SI Figure S1 1. Number of variables: None 2. Number of cases/rows: None 3. Variable List: None 4. Missing data codes: None 5. Specialized formats or other abbreviations used: DATA-SPECIFIC INFORMATION FOR: SI Table S1 1. Number of variables: 7 2. Number of cases/rows: 64 3. Variable List: Specimen ID: Specimen number/catalogue numbers Species: Taxon name Both Analyses?: Yes/No DNE: Dirichlet normal energy 3D-OPCR: 3D orientation patch count rotated RFI: relief index DOI/ ARK/ ARCTOS: Source 4. Missing data codes: Not applicable 5. Specialized formats or other abbreviations used: Not applicable DATA-SPECIFIC INFORMATION FOR: SI Table S2 1. Number of variables: 8 2. Number of cases/rows: 86 3. Variable List: Taxon: Name of the taxa Specimen: Specimen ID/Catalogue numbers Locality: Occurrence of the specimens Length: Mesio-distal length of the tooth in millimeter Width: Bucco-lingual lenth of the tooth in millimeter Square root of Area(Size): Represents size of the tooth Max Width/Max Length (Shape): Represents shape of the tooth Age (Ma): Age of the locality where the specimen occurred 4. Missing data codes: Not applicable 5. Specialized formats or other abbreviations used: Not applicable DATA-SPECIFIC INFORMATION FOR: SI Table S3 1. Number of variables: 9 2. Number of cases/rows: 99 3. Variable List: Taxon: Name of the taxa Elements: Specimen ID/Catalogue numbers Specimen numbers: Locality: Occurrence of the specimens Length: Mesio-distal length of the tooth in millimeter Width: Bucco-lingual lenth of the tooth in millimeter Max Width/Max Length (Shape): Represents shape of the tooth Square root of Area(Size): Represents size of the tooth Age (Ma): Age of the locality where the specimen occurred 4. Missing data codes: Not applicable 5. Specialized formats or other abbreviations used: Not applicable