Data from: High quality whole genome sequence of an abundant Holarctic odontocete, the harbour porpoise (Phocoena phocoena)
Data files
Jul 11, 2018 version files 67.50 MB
Abstract
The harbour porpoise (Phocoena phocoena) is a highly mobile cetacean found across the Northern hemisphere. It occurs in coastal waters and inhabits basins that vary broadly in salinity, temperature, and food availability. These diverse habitats could drive subtle differentiation among populations, but examination of this would be best conducted with a robust reference genome. Here we report the first harbour porpoise genome, assembled de novo from an individual originating in the Kattegat Sea (Sweden). The genome is one of the most complete cetacean genomes currently available, with a total size of 2.39 Gb and 50% of the total length found in just 34 scaffolds. Using just 122 of the longest scaffolds, we were able to show high levels of synteny with the genome of the domestic cattle (Bos taurus). Our draft annotation comprises 22,154 predicted genes, which we further annotated through matches to the NCBI nucleotide database, GO categorization, and motif prediction. Within the predicted genes we have confirmed the presence of >20 genes or gene families that have been associated with adaptive evolution in other cetaceans. Overall, this genome assembly and draft annotation represent a crucial addition to the genomic resources currently available for the study of porpoises and Phocoenidae evolution, phylogeny, and conservation.