Genome and transcriptome-based identification and expression profiling of chemosensory gene families across developmental stages and tissues in Sirex noctilio (Hymenoptera: Siricidae)
Data files
Jan 07, 2026 version files 238.49 MB
Abstract
The Sirex woodwasp (Sirex noctilio; Hymenoptera: Siricidae) is among the most destructive invasive pests affecting Pinus plantations worldwide. Chemosensory systems offer promising targets for pest control strategies, and the identification and characterization of chemosensation genes in non-model, economically significant insects such as S. noctilio is an important first step towards the development of such control methods. Here we sequenced and assembled a draft genome of S. noctilio and performed RNA-sequencing of 15 olfactory and non-olfactory tissues to study the expression patterns of chemosensation-related genes. Specific genes, such as SnocOR16 and SnocSNMP1, displayed tissue- and sex-specific expression patterns, making them particularly intriguing for their potential roles in chemosensation and oviposition. As woodwasps and their related lineages form a sister group to the majority of other Hymenoptera, including Apocrita, insights into their gene repertoires are crucial for tracing the evolutionary history of chemosensory multigene families of this ecologically and economically significant insect order. This study enhances our understanding of the molecular mechanisms underlying S. noctilio chemosensation, paving the way for further research in chemical ecology and the functional characterization of S. noctilio chemosensation genes.
Dataset DOI: 10.5061/dryad.tdz08kqbp
Description of the data and file structure
Principle Investigator Contact Information
Name: Alisa Postma
Institution: Forestry and Agricultural Biotechnology Institute, University of Pretoria
Email: alisa.postma@fabi.up.ac.za
Alternate Contact Information
Name: Bernard Slippers
Institution: Forestry and Agricultural Biotechnology Institute, University of Pretoria
Email: bernard.slippers@fabi.up.ac.za
Dataset Overview
This dataset contains the genome assembly data generated and used in the manuscript, Genome and transcriptome-based identification and expression profiling of chemosensory gene families across developmental stages and tissues in Sirex noctilio (Hymenoptera: Siricidae), currently under review. Here we sequenced and assembled a draft genome of S. noctilio. The assembled genome consisted of 6215 contigs with a total size of 185 Mbp and an N50 length of 825 kbp. The assembly has a BUSCO completeness score of 95.4%. Annotation of the genome assembly resulted in 20,629 predicted genes. The first reported draft genome assembly of S. noctilio was produced in this study.
Description of the data and file structure
The contigs making up the assembled genome and predicted proteins are in a fasta file, which is a plain-text file used in Bioinformatics to store nucleotide or protein sequences. The format consists of a header line starting with > followed by lines of the sequence.
Files and variables
File: Sirex_assembly_May2017_nopipe_nosize.fasta
Description: Genome assembly in FASTA format consisting of long DNA scaffold sequences.
File: Sirex.maker.annotation2017.curated.final2.gtf
Description: Genome annotation file in gtf format. This file maps where genes, exons, and protein-coding regions are located along scaffold1, indicating which strand they are on, how exons are connected, and which prediction method supported each gene model.
File: Sirex_assembly_May2017.all.maker.proteins.SNOC.notrailing.fasta
Description: Predicted protein sequences from annotation of the Sirex genome assembly in fasta format. The wide variation in length and composition reflects different protein types, including secreted peptides, transcription factors (rich in Lys/Arg), enzymes, membrane proteins, and repeat-rich regulatory proteins.
Code/software
Genome assembly, annotation and protein sequence data can be viewed as plain text files.
