Reference genome and annotation for Teleopsis dalmanni

Harney1, Ewan1 ; Jansen van Rensburg, Alexandra 2 ; Alston, Ben1; Price, Peter 1 ; Bates, Sadé 3 ; Tucker, Rachel1; Stagg, Jennifer1; Hipperson, Helen1; Wright, Alison1; Burke, Terry1; Pomiankowski, Andrew4

Published Aug 28, 2025 on Dryad. https://doi.org/10.5061/dryad.j6q573nqw

Data files

Aug 28, 2025 version files 35.11 GB

m64157e_210730_141553.hifi_reads.bam

8.41 GB
m64157e_210730_141553.hifi_reads.fasta.gz

4.43 GB
m64157e_210730_141553.hifi_reads.fastq.gz

8.90 GB
m64157e_211024_013127.hifi_reads.bam

4.93 GB
m64157e_211024_013127.hifi_reads.fasta.gz

2.74 GB
m64157e_211024_013127.hifi_reads.fastq.gz

5.24 GB
README.md

2.24 KB
ST_FINAL.fa

401.38 MB
ST_FINAL.gff

51.53 MB

Abstract

This dataset provides a reference genome assembly and sequencing data for Teleopsis dalmanni (stalk-eyed fly). T. dalmanni is an important model organism for the study of sexual selection, sexual conflicts, and selfish genetic elements; however, only more recently have high-quality genomes become more readily available for understanding the genetic basis of these processes. Here, we present a whole genome assembly (three chromosomes with GFF annotation). The assembly was generated from PacBio HiFi reads from two runs (BAM, FASTQ, and FASTA formats) and Iso-Seq transcript data for annotation. The genome was assembled with HiFiasm, haplotigs removed with purge_dups, and scaffolds generated using publicly available chromatin conformation capture data. This data also showcases the use of the novel annotation method OMAnnotator that uses the OMA algorithm, utilising the evolutionary relationships among genes across species.

Reference genome and annotation for Teleopsis dalmanni

Data files

Abstract

README: Reference genome and sequencing data for Teleopsis dalmanni

Description of the data and file structure