Skip to main content
Dryad

Data from: The effects of read length, quality and quantity on microsatellite discovery and primer development: from Illumina to PacBio

Cite this dataset

Wei, Na; Bemmels, Jordan B.; Dick, Christopher W. (2014). Data from: The effects of read length, quality and quantity on microsatellite discovery and primer development: from Illumina to PacBio [Dataset]. Dryad. https://doi.org/10.5061/dryad.r20t0

Abstract

The advent of next-generation sequencing (NGS) technologies has transformed the way microsatellites are isolated for ecological and evolutionary investigations. Recent attempts to employ NGS for microsatellite discovery have used the 454, Illumina, and Ion Torrent platforms, but other methods including single-molecule real-time DNA sequencing (Pacific Biosciences, or PacBio) remain viable alternatives. We outline a workflow from sequence quality control to microsatellite marker validation in three plant species using PacBio circular consensus sequencing (CCS). We then evaluate the performance of PacBio CCS in comparison to other NGS platforms for microsatellite isolation, through simulations that focus on variations in read lengths, read quantity, and sequencing error rates. Although quality control of CCS reads reduced microsatellite yield by around 50%, hundreds of microsatellite loci that are expected to have improved conversion efficiency to functional markers were retrieved for each species. The simulations quantitatively validate the advantages of long reads, and emphasize the detrimental effects of sequencing errors on NGS-enabled microsatellite development. In view of the continuing improvement in read lengths on NGS platforms, sequence quality and the corresponding strategies of quality control will become the primary factors to consider for effective microsatellite isolation. Among current options, PacBio CCS may be optimal for rapid, small-scale microsatellite development due to its flexibility in scaling sequencing effort, while platforms such as Illumina MiSeq will provide cost-efficient solutions for multi-species microsatellite projects.

Usage notes

Location

Barro Colorado Island
Panama