Skip to main content
Dryad logo

Data from: phenotools: an R package for visualizing and analyzing phenomic datasets

Citation

Eliason, Chad M.; Edwards, Scott V.; Clarke, Julia A. (2019), Data from: phenotools: an R package for visualizing and analyzing phenomic datasets, Dryad, Dataset, https://doi.org/10.5061/dryad.05qm36k

Abstract

1.Phenotypic data is crucial for understanding genotype–phenotype relationships, assessing the tree of life, and revealing trends in trait diversity over time. Large‐scale description of whole organisms for quantitative analyses (phenomics) presents several challenges, and technological advances in the collection of genomic data outpace those for phenomic data. Reasons for this disparity include the time‐consuming and expensive nature of collecting discrete phenotypic data and mining previously‐published data on a given species (both often requiring anatomical expertise across taxa), and computational challenges involved with analyzing high‐dimensional datasets. 2.One approach to building approximations of organismal phenomes is to combine published datasets of discrete characters assembled for phylogenetic analyses into a phenomic dataset. Despite a wealth of legacy datasets in the literature for many groups, relatively few methods exist for automating the assembly, analysis, and visualization of phenomic datasets in phylogenetic contexts. Here, we introduce a new R package phenotools for integrating (fusing original or legacy datasets), curating (finding and removing duplicates), and visualizing phenomic datasets. 3.We demonstrate the utility of the proposed toolkit with a morphological dataset for flightless birds and two morphological datasets for theropod dinosaurs and provide recommendations for character construction to maximize accessibility in future workflows. Visualization tools allow rapid identification of anatomical sub‐regions with difficult or problematic histories of homology. 4.We anticipate these tools aiding automation of the assembly and visualization of phenomic datasets to inform evolutionary relationships and rates of phenotypic evolution.

Usage Notes

Funding

National Science Foundation, Award: EAR-1355292