Skip to main content
Dryad

Genome assemblies for halophilic bacteria with potential contamination, sampled from Northern California in 2022

Data files

Nov 29, 2024 version files 22.07 MB

Abstract

BIS 23 is an undergraduate research course offered at UC Davis, designed to teach new undergraduates the fundamentals of good science and academic research. The course is structured around the collection and sequencing of halophilic organisms (e.g. salt-tolerant bacteria) from samples taken by students themselves. In the 2022/2023 iteration of the course, students took dozens of environmental samples from Northern California, primarily in Davis, CA. These samples were cultured, incubated, and the most successful ones were sent for sequencing. Nine genome scaffolds were successfully created after sequencining on the Illumina MiSeq platform and processing with Trimmomatic 0.36 and SPAdes 3.7. Only 6 of the 9 scaffolds were accepted by the NCBI Genome database, due to suspected contamination in the remaining 3. This dataset provides the FASTA files for the 3 assemblies with suspected contamination. Researchers can freely access and use this dataset for any purpose, such as applying more advanced contamination detection and removal algorithms that were beyond the scope of the undergraduate course.