Skip to main content
Dryad

Taxonomic reassessment of genomes from a divergent population of Streptococcus suis by average nucleotide identity analysis

Data files

May 20, 2025 version files 1.60 MB

Click names to download individual files

Abstract

This dataset provides genomic-level re-identification and taxonomic reassessment of Streptococcus suis out-population strains based on average nucleotide identity (ANI), 16S rRNA gene comparisons, and other phylogenetic markers. The dataset includes an Excel workbook with eight sheets (Sheet A–H), which together constitute Supplementary Table S1. This table contains detailed information on the studied strains, including their disease association, sequence alignment results (identity, coverage, and length) of 16S rRNA genes against both the S. suis type strain S735 and a database of prokaryotic type strains, ANI and AAI values compared to S735, accession numbers for short-read and long-read data, genome assembly IDs, phylogenetic lineage within the S. suis tree, group classification within the out-population, and cluster medoid status indicating representative strains. Through comparative genomic analysis, this study established an ANI threshold of 93.17% for defining authentic S. suis, revealing that all 645 genomes from the out-population fell below this cutoff and thus did not belong to S. suis. Further pairwise ANI comparison identified 18 distinct clusters among these strains, leading to the proposal of 12 novel Streptococcus species and the reclassification of six as known species. This dataset serves as a valuable resource for researchers studying streptococcal taxonomy, zoonotic pathogens, and bacterial evolution. It enables further investigations into genomic diversity, species delineation, and the evolutionary relationships within the genus Streptococcus.