Skip to main content
Dryad

Reference genome of an irruptive migrant, the pine siskin (Spinus pinus)

Data files

Oct 18, 2025 version files 2.40 GB

Click names to download individual files

Abstract

This dataset contains unscaffolded and supporting files associated with the chromosome-level genome assembly of the Pine Siskin (Spinus pinus). We provide two unscaffolded haplotype assemblies (pisiContigs_hap1.fasta and pisiContigs_hap2.fasta) that correspond to the scaffolded assemblies deposited in NCBI under BioProject accession numbers [PRJNA1281197] (primary haplotype) and [PRJNA1281196] (alternate haplotype). To document the scaffolding process, we include AGP files generated by RagTag (ragtag.scaffold.Hap1.agp and ragtag.scaffold.Hap2.agp), which specify the ordering and orientation of contigs into pseudochromosomes.

To support repeat annotation and comparative analyses, we also provide both raw and curated repeat libraries. The raw library (pisiRepeats-families.fa) contains de novo repeat family predictions. Curated libraries were refined using MCHelper and are provided in redundant (curated_sequences_R.fa) and nonredundant (curated_sequences_NR.fa) formats.