Skip to main content
Dryad

Data from: Discovery and characterization of single nucleotide polymorphisms in Chinook salmon, Oncorhynchus tshawytscha

Data files

Dec 17, 2010 version files 202.80 KB

Click names to download individual files

Abstract

Molecular population genetics of non-model organisms has been dominated by the use of microsatellite loci over the last two decades. The availability of extensive genomic resources for many species is contributing to a transition to the use of single nucleotide polymorphisms (SNPs) for the study of many natural populations. Here we describe the discovery of a large number of SNPs in Chinook salmon, one of the world’s most important fishery species, through large-scale Sanger sequencing of expressed sequence tag (EST) regions. More than 3MB of sequence was collected in a survey of variation in more than 131KB of unique genic regions, from more than 225 separate ESTs, in a diverse ascertainment panel of 24 salmon. This survey yielded 117 TaqMan (5’ nuclease) assays, almost all from separate EST regions, which were validated in population samples from 5 major stocks of salmon from the three largest basins on the Pacific coast of the coterminous United States: the Sacramento, Klamath and Columbia Rivers. The proportion of these loci that was variable in each of these stocks ranged from 86.3 to 90.6% and the mean minor allele frequency ranged from 0.194 to 0.236. There was substantial differentiation between populations with these markers, with a mean FST estimate of 0.107, and values for individual loci ranging from 0 to 0.592. This substantial polymorphism and population-specific differentiation indicates that these markers will be broadly useful, including for both pedigree reconstruction and genetic stock identification applications.