Bulk RNA-seq and snRNA-seq data from the Middle Temporal Gyrus of individuals with Alzheimer's disease and healthy controls
Data files
Jun 03, 2026 version files 338.96 MB
-
featureCounts_cleaned_bulk_nuclei.txt
2.55 MB
-
featureCounts_cleaned_bulk_tissue.txt
2.33 MB
-
filtered_feature_bc_matrix_snRNA.h5
334.08 MB
-
README.md
1.49 KB
Abstract
The molecular signatures of neurological disorders arise from the specific contribution of many cell types. Cell type-specific dissection of such signatures is critical to characterizing complex underlying mechanisms driving such disorders and to efficiently deriving therapeutic targets. Although single-nucleus technologies have led to key findings in mechanisms of brain diseases, the limited nuclear transcriptional coverage coupled with the high cost and advanced technical skills required have hindered their broad applicability and potential to unveil robust cell type-specific candidates. Here we develop and validate an AI-based framework that enables low-cost and large-scale cell type-specific investigation of comprehensive transcriptional programs from bulk RNA-seq, significantly outperforming previous methods. Our approach leverages the ability of the transformer model to digest a large volume of data and learn a generalizable mapping, thereby restoring cell type-specific RNA profile from bulk expression. Notably, it allows the investigation of cell type-specific transcriptomic programs in complex and heterogeneous phenotypes such as cognitive resilience or brain resistance to AD. In multiple regions of the cerebral cortex, we identify astrocytes as the major cell mediator of resilience. In contrast, excitatory neurons and oligodendrocyte progenitor cells are identified as the major cell mediators of resistance, with putative molecular candidates maintaining synaptic function and preserving neuron health. Finally, using an optimally preserved mouse brain, we show the potential of our approach to restore the whole tissue cell-type specific transcriptome, rather than single nucleus only, offering a framework for an unbiased investigation of whole brain cell transcriptome.
Sample overview
Frozen middle temporal gyrus tissue was placed on a dry ice block, and a 4mm punch was used to excise a piece of grey matter from the same reference point for each sample. The sample was then split into two portions - one for single/bulk nuclei RNA sequencing and the other for bulk tissue RNA sequencing - and shipped to Seqmatic LLC (Fremont, CA, US). At Seqmatic, nuclei were isolated according to the Chromium Nuclei Isolation Reagent Kits Sample Prep User Guide (CG000505) and captured using 10X Genomics Controller device (Pleasanton, CA, US Libraries were prepared using the 10x Genomics Chromium Single Cell 3’ v3 Reagent Kit and sequenced on NovaSeq X Series (Illumina, San Diego, CA, US).
Description of the data and file structure
Gene counts were obtained by aligning reads to the GRCh38-2020-A genome using Cell Ranger software v7.0.1 (10x Genomics) and was performed by Seqmatic company.
filtered_feature_bc_matrix_snRNA.h5: raw snRNA gene matrix in AnnData format.
featureCounts_cleaned_bulk_tissue.txt: raw bulk RNA expression matrix from bulk tissue
featureCounts_cleaned_bulk_nuclei.txt: raw bulk RNA expression matrix from bulk nuclei
Human subjects data
Primary brain samples were obtained at the University of Washington, from post-mortem tissue following informed consent and Institutional Review Boards approval. Patient-level metadata are available from the corresponding author upon reasonable request.
