Skip to main content
Dryad

PhyloSift Markers Database

Cite this dataset

Jospin, Guillaume (2018). PhyloSift Markers Database [Dataset]. Dryad. https://doi.org/10.25338/B8F30J

Abstract

Like all organisms on the planet, environmental microbes are subject to the forces of molecular evolution. Metagenomic sequencing provides a means to access the DNA sequence of uncultured microbes. By combining DNA sequencing of microbial communities with evolutionary modeling and phylogenetic analysis we might obtain new insights into microbiology and also provide a basis for practical tools such as forensic pathogen detection.

In this work we present an approach to leverage phylogenetic analysis of metagenomic sequence data to conduct several types of analysis. First, we present a method to conduct phylogeny-driven Bayesian hypothesis tests for the presence of an organism in a sample. Second, we present a means to compare community structure across a collection of many samples and develop direct associations between the abundance of certain organisms and sample metadata. Third, we apply new tools to analyze the phylogenetic diversity of microbial communities and again demonstrate how this can be associated to sample metadata.

These analyses are implemented in an open source software pipeline called PhyloSift. As a pipeline, PhyloSift incorporates several other programs including LAST, HMMER, and pplacer to automate phylogenetic analysis of protein coding and RNA sequences in metagenomic datasets generated by modern sequencing platforms (e.g., Illumina, 454).

Usage notes

This marker database is used by the PhyloSift pipeline. It looks for its default location in the following directory "/home/username/share/phylosift".
Extract the archive manually in this location to start using the pipeline normally.

The marker location can be modified by changing the path in the phylosiftrc file.