Data from: Comprehensive identification and clustering of CLV3/ESR-related (CLE) genes in plants finds groups with potentially shared function
Goad, David M.; Zhu, Chuanmei; Kellogg, Elizabeth A. (2017), Data from: Comprehensive identification and clustering of CLV3/ESR-related (CLE) genes in plants finds groups with potentially shared function, Dryad, Dataset, https://doi.org/10.5061/dryad.dr714
CLV3/ESR (CLE) proteins are important signaling peptides in plants. The short CLE peptide (12–13 amino acids) is cleaved from a larger pre-propeptide and functions as an extracellular ligand. The CLE family is large and has resisted attempts at classification because the CLE domain is too short for reliable phylogenetic analysis and the pre-propeptide is too variable.
We used a model-based search for CLE domains from 57 plant genomes and used the entire pre-propeptide for comprehensive clustering analysis.
In total, 1628 CLE genes were identified in land plants, with none recognizable from green algae. These CLEs form 12 groups within which CLE domains are largely conserved and pre-propeptides can be aligned. Most clusters contain sequences from monocots, eudicots and Amborella trichopoda, with sequences from Picea abies, Selaginella moellendorffii and Physcomitrella patens scattered in some clusters. We easily identified previously known clusters involved in vascular differentiation and nodulation. In addition, we found a number of discrete groups whose function remains poorly characterized. Available data indicate that CLE proteins within a cluster are likely to share function, whereas those from different clusters play at least partially different roles.
Our analysis provides a foundation for future evolutionary and functional studies.
National Science Foundation, Award: IOS-1413824