Data from: Using phylogenomics to resolve mega-families: an example from Compositae

Mandel, Jennifer R.1; Dikow, Rebecca B.2; Funk, Vicki A.3

Published May 15, 2016 on Dryad. https://doi.org/10.5061/dryad.k9k23

Data files

May 15, 2016 version files 12.93 MB

100_random_clusters_for_network.tre

17.76 KB
astral_conservative.tre

331 B
chloroplast_all_final.fasta

6.25 MB
chloroplast.tre

3.03 KB
conservative_COS_concatenated.fasta

6.31 MB
conservative_COS_concatenated.tre

1.46 KB
Gitools_heatmap_all_clusters.csv

230.71 KB
Gitools_heatmap_conservative_COS.csv

50.50 KB
low_copy_cluster_trees.zip

70.69 KB

Abstract

Next-generation sequencing and phylogenomics hold great promise for elucidating complex relationships among large plant families. Here we performed targeted capture of low copy sequences followed by next-generation sequencing on the Illumina platform in the large and diverse angiosperm family Compositae (Asteraceae). The family is monophyletic based on morphology and molecular data, yet many areas of the phylogeny have unresolved polytomies and interpreting phylogenetic patterns has been historically difficult. In order to outline a method and provide a framework and for future phylogenetic studies in the Compositae, we sequenced 23 taxa from across the family in which the relationships were well established as well as a member of the sister family Calyceraceae. We generated nuclear data from 795 loci and assembled chloroplast genomes from off-target capture reads enabling the comparison of nuclear and chloroplast genomes for phylogenetic analyses. We also analyzed multi-copy nuclear genes in our data set using a clustering method during orthology detection, and we applied a network approach to these clusters—analyzing all related locus copies. Using these data we produced hypotheses of phylogenetic relationships employing both a conservative (restricted to only loci with one copy per targeted locus) and a multigene approach (including all copies per targeted locus). The methods and bioinformatics workflow presented here provide a solid foundation for future work aimed at understanding gene family evolution in the Compositae as well as providing a model for phylogenomic analyses in other plant mega-families.

Data from: Using phylogenomics to resolve mega-families: an example from Compositae

Data files

Abstract

conservative_COS_concatenated_tree

conservative_COS_concatenated_fasta

astral_conservative_COS_tree

chloroplast_all_tree

low_copy_cluster_trees_used_for_network

100_random_cluster_trees_for_network

chloroplast_all_final

Gitools_heatmap_all_clusters

Gitools_heatmap_conservative_COS

Data from: Using phylogenomics to resolve mega-families: an example from Compositae

Data files

Abstract

Usage notes

conservative_COS_concatenated_tree

conservative_COS_concatenated_fasta

astral_conservative_COS_tree

chloroplast_all_tree

low_copy_cluster_trees_used_for_network

100_random_cluster_trees_for_network

chloroplast_all_final

Gitools_heatmap_all_clusters

Gitools_heatmap_conservative_COS

Works referencing this dataset