Single-cell multiomics of neuronal activation reveals context-dependent genetic control of brain disorders
Data files
Mar 02, 2025 version files 318.36 MB
-
All_other_supplementary_Tables_(1-3__5__7__8_10_11__15__16-18_22_26-29_31_32_33).xlsx
28.74 MB
-
README.md
27.42 KB
-
Table_S12_ABC_EnhancerPredictions_threshold0.021_self_promoter.xlsx
62.06 MB
-
Table_S13._Differential_TF_motif_activity_analysis_using_a_one-tailed_Wilcoxon_signed-rank_test.xlsx
93.69 KB
-
Table_S14._Gene_regulatory_networks_identified_in_three_cell_types.xlsx
3.50 MB
-
Table_S19_Signficant_eQTLs_in_each_cell_type_and_time_point.xlsx
64.12 MB
-
Table_S20._Dynamic_cGenes_in_each_cell_type_and_time_point.xlsx
877.12 KB
-
Table_S21._eQTL-based_cTWAS_results.xlsx
70.51 KB
-
Table_S23._cPeak_and_caQTL_identified_by_tensorQTL_in_each_cell_type_and_time_point.xlsx
8.14 MB
-
Table_S24._ASoC_SNPs_in_each_cell_type_and_time_point.xlsx
113.96 MB
-
Table_S25._Dynamic_caQTLs_for_each_cell_type_with_permutation_(n_1000)_testing_result.xlsx
1.57 MB
-
Table_S30._caQTL-based_cTWAS_results.xlsx
82.04 KB
-
Table_S6._Differentially_accessible_peaks_upon_KCI_stimulation_table.xlsx
31.93 MB
-
Table_S9_OCR-gene_pairs_identified_through_single-cell_level_peak-gene_correlation_(co-activation)_analysis.xlsx
3.18 MB
Abstract
Despite hundreds of genetic risk loci identified for neuropsychiatric disorders (NPD), most causal variants/genes remain unknown. A major hurdle is that disease risk variants may act in specific biological contexts, e.g., during neuronal activation, which is difficult to study in vivo at the population level. Here, we modeled neuronal activation in human iPSC-induced excitatory and inhibitory neurons from 100 donors. Single-cell multiomics analyses of over a million neurons uncovered complex activity-dependent transcriptomic and epigenomic regulation and significantly expanded the repertoire of stimulation-specific causal variants/genes for NPD. We identified thousands of genetic variants associated with activity-dependent gene expression (i.e., eQTL) and chromatin accessibility (i.e., caQTL). These caQTL explained considerably larger proportions of NPD heritability than the eQTL. Integrating the multiomic data with GWAS revealed NPD risk variants/genes whose effects were only detected upon stimulation. Interestingly, multiple lines of evidence support a role of activity-dependent cholesterol metabolism in NPD. Our work highlights the power of cell stimulation to reveal context-dependent “hidden” genetic effects.
https://doi.org/10.5061/dryad.0zpc8677w
Description of the data and file structure
Single-cell multiomics of neuronal activation reveals context-dependent genetic control of brain disorders.
Lifan Liang1,†, Siwei Zhang2,5,†, Zicheng Wang1,†, Hanwen Zhang2,†, Chuxuan Li2,3,†, Alexandra C. Duhe2, Xiaotong Sun1, Xiaoyuan Zhong1, Alena Kozlova2, Brendan Jamison1,2, Whitney Wood2, Zhiping P. Pang4, Alan R. Sanders2,5, Xin He1,‡, Jubao Duan2,5,‡
Affiliations:
1Department of Human Genetics, The University of Chicago, Chicago, IL 60637, USA.
2Center for Psychiatric Genetics, Endeavor Health Research Institute, Evanston, IL 60201, USA.
3Graduate Group in Genomics and Computational Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
4Department of Neuroscience and Cell Biology, Child Health Institute of New Jersey, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ 08901, USA.
5Department of Psychiatry and Behavioral Neuroscience, The University of Chicago, Chicago, IL 60637, USA.
† These authors contributed equally
‡ Corresponding authors: Xin He (xinhe@uchicago.edu), Jubao Duan (jduan@uchicago.edu)
Files and variables
File: Table_S9_OCR-gene_pairs_identified_through_single-cell_level_peak-gene_correlation_(co-activation)_analysis.xlsx
Description: Table S9. OCR-gene pairs identified through single-cell level peak-gene correlation (co-activation) analysis.
Variables
- Gene: Gene symbol;
- Peak: correlated peak position, chromosome:start-end;
- Beta: calculated coefficient from the linear model;
- SE: standard error calculated from the simple linear model;
- Z: z-score;
- Pvalue: P value;
- FDR: False Discovery Rate. The adjusted p-values corrected for multiple testing (Benjamini-Hochberg).
File: Table_S13._Differential_TF_motif_activity_analysis_using_a_one-tailed_Wilcoxon_signed-rank_test.xlsx
Description: Table S13. Differential TF motif activity analysis using a one-tailed Wilcoxon signed-rank test.
Variables
- TF: transcription factor motif name;
- Pval-test_1hrVS0hr: p-value of the Wilconxon signed-rang test, 1 hr stimulation vs unstimulated;
- Pval-test_6hrVS0hr: p-value of the Wilconxon signed-rang test, 6 hr stimulation vs unstimulated;
- Pval-test_6hrVS1hr: p-value of the Wilconxon signed-rang test, 6 hr stimulation vs 1 hr stimulation;
- FDR-test_1hrVS0hr: BH-adjusted p-value (FDR) of the Wilconxon signed-rang test, 1 hr stimulation vs unstimulated;
- FDR-test_6hrVS0hr: BH-adjusted p-value (FDR) of the Wilconxon signed-rang test, 6 hr stimulation vs unstimulated;
- FDR-test_6hrVS1hr: BH-adjusted p-value (FDR) of the Wilconxon signed-rang test, 6 hr stimulation vs 1 hr stimulation.
File: Table_S12_ABC_EnhancerPredictions_threshold0.021_self_promoter.xlsx
Description: Table S12. Enhancer-gene pairs predicted by the ABC (Activity-by-Contact) model, with an ABC score ≥ 0.021. Predictions from all nine contexts are included.
Variables
- Chr: chromosome of the predicted enhancer;
- Start: start position of the predicted enhancer;
- End: end position of the predicted enhancer;
- Name: Name of the predicted enhancer;
- TargetGene: predicted target gene of the corresponding enhancer region;
- TargetGeneTSS : genomic distance (bp) between the corresponding enhancer region and the TSS site of its target gene;
- Context: time x cell type combination of the corresponding enhancer-gene pairs found;
- ABC.Score: the final score of Activity-by-Contact model.
File: Table_S25.Dynamic_caQTLs_for_each_cell_type_with_permutation(n_1000)_testing_result.xlsx
Description: Table S25. Dynamic caQTLs for each cell type with permutation (n=1000) testing result.
Variables
- phenotype_id: phenotype id (as in cis-mod);
- variant_id: SNP id;
- empirical p_val: raw empirical p-value, as calculated by TensorQTL;
- empirical fdr: Adjusted empirical p-values (q-values), as calculated by the qvalue package.
File: Table_S23._cPeak_and_caQTL_identified_by_tensorQTL_in_each_cell_type_and_time_point.xlsx
Description: Table S23. cPeak and caQTL identified by tensorQTL in each cell type and time point.
Variables
- phenotype_id: phenotype id (as in cis-mod);
- num_var: Number of variants in cis-window;
- beta_shape1: Parameter of the fitted Beta distribution;
- beta_shape2: Parameter of the fitted Beta distribution;
- true_df: Degrees of freedom used to compute p-values;
- pval_true_df: Nominal p-value based on true_df;
- variant_id: SNP id;
- start_distance: Distance between the variant and phenotype start position (e.g., TSS);
- end_distance: Distance between the variant and phenotype end position (only present if different from start position);
- ma_samples: Number of samples carrying at least on minor allele;
- ma_count: Number of minor alleles;
- af: In-sample ALT allele frequency of the variant;
- pval_nominal: Nominal p-value of the association between the phenotype and variant;
- slope: Regression slope;
- slope_se: Standard error of the regression slope;
- pval_perm: Empirical p-value from permutations;
- pval_beta: Beta-approximated empirical p-value;
- qval: Storey q-value corresponding to pval_beta;
- pval_nominal_threshold: Nominal p-value threshold for significant associations with the phenotype.
File: Table_S19._Signficant_eQTLs_in_each_cell_type_and_time_point.xlsx
Description: Table S19. Significant eQTLs in each cell type and time point.
Variables
- Snps: the SNP used for eQTL analysis (Matrix-eQTL);
- Gene: the gene used for eQTL analysis;
- Statistic: sample correlation (|r|) as calculated by the simple linear model;
- Pvalue: raw p-value as calculated by the simple linear model;
- FDR: Adjusted p-values, corrected for multiple testing (Benjamini-Hochberg);
- Beta: coefficient calculated from the simple linear model;
- Se: standard error calculated from the simple linear model.
File: Table_S24._ASoC_SNPs_in_each_cell_type_and_time_point.xlsx
Description: Table S24. ASoC SNPs in each cell type and time point.
Variables
- CHROM: SNP chromosome;
- POS: SNP position;
- ID: SNP rsid;
- REF: reference allele;
- ALT: alternative allele;
- QUAL: quality score;
- NCALLED: number of total samples collated in analysis;
- REF_N: reference allele count;
- ALT_N: alternative allele count;
- DP: depth;
- pVal: nominal p-value of the binominal test;
- FDR: Adjusted p-values, corrected for multiple testing (Benjamini-Hochberg).
File: Table_S6._Differentially_accessible_peaks_upon_KCI_stimulation_table.xlsx
Description: Table S6. Differentially accessible peaks upon KCI stimulation in each cell type.
Variables
- Chr: chromosome;
- Start: peak start position;
- End: peak end position;
- Strand: strandness information.
- Ave-log2FC: average log2 fold change of the corresponding peak between groups;
- P-value : P-value associated with the t-test used during analysis;
- Adj_p: Adjusted p-values, corrected for multiple testing (Benjamini-Hochberg).
File: Table_S14._Gene_regulatory_networks_identified_in_three_cell_types.xlsx
Description: Table S14. Gene regulatory networks identified in three cell types.
Variables
- TF: transcript factor;
- Target Gene: TF-targeting genes;
- Correlation: Spearman’s correlation coefficient.
File: Table_S20._Dynamic_cGenes_in_each_cell_type_and_time_point.xlsx
Description: Table S20. Dynamic eGenes in each cell type and time point.
Variables
- Snps: the SNP used for dynamic eQTL analysis (CellRegMap);
- Gene: the gene used for dynamic eQTL analysis (CellRegMap);
- p_inter: raw p-value from the interaction (GxC) test;
- Bonf: p-value adjusted at the gene level using the Bonferroni procedure to control the family‐wise error rate;
- FDR: the Bonf values further adjusted across genes, Storey’s method.
File: Table_S21._eQTL-based_cTWAS_results.xlsx
Description: Table S21. eQTL-based cTWAS results.
Variables
- PIP_sum: sum of posterior inclusion probability (PIP);
- PIP_0hr_GABA: PIP from 0hr in GABA, the other 8 columns are similar;
- GWAS_Pval_loci: the minimum P value in the loci from GWAS;
- eQTL_Pval: P value of the eQTL from the group with the highest PIP;
- Zgene_Pval: Imputed Z score of the gene with the highest PIP;
- Cell type: the cell type of the gene if over 60% of PIP comes from this cell type;
- PIP_diff : difference of PIP between the resting state (0hr) and the stimulated states (1/6 hour) across all cell types;
- Stimulated: a gene is dynamic if PIP_diff > 50%, otherwise static”;
- Rsid: rsid for the SNPs supporting the risk gene;
- Position: genomic position of the supporting SNP in hg38;
- ref_allele: reference allele;
- eff_allele: alternative allele;
- context: the context from which the SNP supports the eGene.
File: Table_S30._caQTL-based_cTWAS_results.xlsx
Description: Table S30. caQTL-based cTWAS results.
Variables
- condition: the condition with the highest PIP;
- snp: the SNP from the condition with the highest PIP;
- eqtl.gene: potential target genes from colocalized eQTL;
- range: location of the risk peak;
- p2g.gene: potential target gene derived from the correlation between peaks and genes from sc-multi omics profile.;
- overlap.gene: genes that overlap with the peak;
- tss.gene: potential target genes with transcription start site (TSS) closest to the peak;
- tss.dist: distance between the risk peak and the TSS;
- ABC.gene: potential target genes of the risk peaks according to our ABC scores;
- ABC.max: the target gene with the maximum ABC score;
- rsid: rsid for the SNPs supporting the risk gene;
- position: genomic position of the supporting SNP in hg38;
- ref_allele: reference allele;
- eff_allele: alternative allele;
- context: the condition from which the SNP supports the eGene.
File: All_other_supplementary_Tables_(1-3__5__7__8_10_11__15__16-18_22_26-29_31_32_33).xlsx
Description: All_other_supplementary_Tables_(1-3__5__7__8_10_11__15__16-18_22_26-29_31_32_33)
Table S1. Donor information of all the iPSC lines and their summary statistics of sn-multiomics data for all three time points of KCI stimulation.
Variables
- Cell line: Cell line ID representing anonymized individual donors;
- Aff: schizophrenia patients/healthy individuals. Case: SCZ patients; control: healthy individuals;
- Sex: individual sex at birth. M: male; F: female;
- Age: age at blood draw, numerical;
- Ancestry: ancestry information of individual donors;
- Co-culture batch: culture group ID used during neuron differentiation. 3-5 individuals per co-culture batch;
- Seq_batch: group ID describing the batch of Illumina NextSeq sequencing. Factorial;
- Time_point: property describing the stimulation status of each library. 0hr: unstimulated; 1hr: 1 hour post-stimulation; 6hr: 6 hours post-stimulation;
- GABA_count: the number of GABAergic neurons (barcodes) in this sample;
- Nmglut_count: the number of NEFM- glutamatergic neurons in this sample;
- Npglut_count: the number of NEFM+ glutamatergic neurons in this sample;
- intermediateGlut_count: the number of intermediate neurons in this sample;
- unidentified: the number of cells (barcodes) with uncertain identity;
- GABA_proportion: the fraction of GABAergic neurons in this sample;
- Nmglut_proportion: the fraction of NEFM- glutamatergic neurons in this sample;
- Npglut_proportion: the fraction of NEFM+ glutamatergic neurons in this sample;
- intermediateGlut_proportion: the fraction of intermediate neurons in this sample;
- unidentified_proportion: the fraction of cells (barcodes) with uncertain identity;
- total_umber: the gross number of valid cells (barcodes) identified in this sample;
- Used in SCZ analysis: whether this sample has been used during the differential expression analysis between SCZ/control groups.
**Table S2. Summary statistics of each sequencing library before and after QC. Each sequencing library is for cells of a co-cultured 2-4 iPSC lines at a particular time point of stimulation. **
Variables
- Sample ID: unique library IDs assigned to the corresponding sample (as separate 10x Genomics scGEX+ATAC-seq libraries), constructed as [Sequencing batch]_[sequencing group id]-[time];
- Estimated number of cells: QC-passed cell counts (as barcodes) of the corresponding library from the output of 10x Genomics Cell Ranger ARC;
- ATAC Sequenced read pairs: QC-passed scATAC-seq counts of the corresponding library;
- ATAC mean raw read pairs per cell: Mean of QC-passed scATAC-seq counts per cell of the corresponding library;
- ATAC Fraction of high-quality fragments overlapping TSS: the fraction of QC-passed reads overlapping transcription start sites (TSS) of the corresponding library;
- GEX Sequenced read pairs: QC-passed scRNA-seq (Gene expression, GEX) counts of the corresponding library;
- GEX mean raw read pairs per cell: The Mean values of QC-passed scRNA-seq (GEX) counts per cell of the corresponding library;
- GEX Median UMI counts per cell: The median values of QC-passed scRNA-seq (GEX) counts per cell of the corresponding library;
- GEX Reads mapped confidently to genome: gross number of GEX read pairs mapped confidently to human GRCh38 genome of the corresponding library;
- GEX Reads mapped confidently to transcriptome: gross number of GEX read pairs mapped confidently to human GRCh38 transcriptomeof the corresponding library;
- After_demux: cell counts after genotype-based de-multiplex. Representing cells (barcodes) confidently assigned to individuals;
- After_final_QC: Counts of cells (barcode) that are both confidently assigned to individuals and passed QC control. Representing the barcodes used in all subsequent analysis;
- Note: Note.
Table S3. SnRNA-seq post-QC summary data for all 100 lines.
Variables
- Line: cell line, same as in Table S1;
- Cell_counts: gross number of cells that have been assigned to this cell line;
- n_UMI: gross number of Unique Molecular Identifier (UMIs) that have been assigned to this cell line;
- UMI.per.cell: Mean values of UMIs per cell (barcode) of the corresponding cell line;
Table S5. NPD gene sets used for enrichment analyses.
Variables
- Prioritized PGC3 GWAS genes (n=65) (Nature 2022)
- SZ SCHEMA rare risk genes_Q meta <0.05 (Nature 2022)
- ASD risk genes (N=102) (Cell 2020);
- ASD genes (N=185) (Nature Genetics 2022);
- NDD genes (n=664) (Nature Genetics 2022);
- BP GWAS genes (Nature Genetics 2021);
- MDD GWAS genes (Nature Neuroscience 2019);
- PTSD GWAS genes (Nature Genetics 2021).
Table S7. Genes included in each gene module (cluster).
Variables
- Gene: gene symbol (GENCODE v35);
- Cluster: cluster number assigned to the corresponding gene.
Table S8. GO terms (biological process) enriched for each gene module.
Variables
- Term: GO term and ID;
- Overlap: Gene hits, sample/background;
- P.value: computed p-value of the Fisher’s exact test;
- Adjusted.P.value: Adjusted p-values, corrected for multiple testing (Benjamini-Hochberg);
- Odds.Ratio: The ratio of the proportion of genes in the query gene set annotated to the corresponding GO term divided by the proportion of genes in the background gene set of the corresponding GO term;
- Combined.Score: calculated as ln(p)*z. P represents the p-value computed using the Fisher exact test. Z represents the z-score computed by assessing the deviation from the expected rank.
- Genes: the list of all gene hits of the corresponding GO term according to the query gene set;
- Cluster: functional cluster assigned to the corresponding GO term.
Table S10. Micro-C summary statistics.
Variables
- Lib: Micro-c library used;
- Total: gross read count and (as the percentage of) total reads of each library;
- total_unmapped: gross unmapped read count and (as the percentage of) total reads;
- total_single_sided_mapped: gross count of read pairs that have only one read mapped to the genome and (as the percentage of) total reads;
- total_mapped: gross count of read pairs that mapped to the genome and (as the percentage of) total reads;
- total_dups: gross count of duplicated read pairs in the library and (as the percentage of) total reads;
- total_nodups: gross count of read pairs after de-duplication and (as the percentage of) total reads;
- cis: gross count of intra-chromosome interactions found and (as the percentage of) total reads;
- trans: gross count of inter-chromosome interactions found and (as the percentage of) total reads;
- cis_1kb+: gross count of intra-chromosome interactions found with separation of more than 1 kb distance and (as the percentage of) total reads;
- cis_2kb+: gross count of intra-chromosome interactions found with separation of more than 2 kb distance and (as the percentage of) total reads;
- cis_4kb+: gross count of intra-chromosome interactions found with separation of more than 4 kb distance and (as the percentage of) total reads;
- cis_10kb+: gross count of intra-chromosome interactions found with separation of more than 10 kb distance and (as the percentage of) total reads;
- cis_20kb+: gross count of intra-chromosome interactions found with separation of more than 20 kb distance and (as the percentage of) total reads;
- cis_40kb+: gross count of intra-chromosome interactions found with separation of more than 40 kb distance and (as the percentage of) total reads;
Table S11. Micro-C chromatin contacts in each sample. Listed are 5 kb bins.
Variables
- bait_chr: chromosome of the bait sequence;
- bait_start: start position of the bait sequence;
- bait_end: end position of the bait sequence;
- bait_name: name of the bait sequence;
- otherEnd_chr: chromosome of the other end of the fragment;
- otherEnd_start: start position of the other end of the fragment;
- otherEnd_end: end position of the other end of the fragment;
- otherEnd_name: name of the fragment, if any;
- N_reads: gross count of the fragments found;
- Score: the final CHiCAGO score of the corresponding fragment.
Table S15. Target genes regulated by the ASD risk TFs in each cell type.
Variables
- TF: transcript factor;
- Cell type: cell type in which the target genes were identified;
- Number of target genes: the number of genes targeted by the corresponding TF;
- Target genes: the full list of target genes.
Table S16. GO-term (Biological Process) enrichment for four ASD risk-associated TFs. The FDR was recalculated to account for multiple testing across the four TFs.
Variables
TF: transcription factor;
- Term: GO term and ID;
- Overlap: Gene hits, sample/background;
- P.value: computed p-value of the Fisher’s exact test;
- FDR: Adjusted p-values, corrected for multiple testing (Benjamini-Hochberg);
- Odds.Ratio: The ratio of the proportion of genes in the query gene set annotated to the corresponding GO term divided by the proportion of genes in the background gene set of the corresponding GO term;
- Combined.Score: calculated as ln(p)*z. P represents the p-value computed using the Fisher exact test. Z represents the z-score computed by assessing the deviation from the expected rank.
- Genes: the list of all gene hits of the corresponding GO term according to the query gene set.
Table S17. GO-term (Molecular Function) enrichment of target genes shared by at least three of the ASD risk TFs. Only "molecular function" terms are enriched.
Variables
TF: transcription factor;
- Term: GO term and ID;
- Overlap: Gene hits, sample/background;
- P.value: computed p-value of the Fisher’s exact test;
- FDR: Adjusted p-values, corrected for multiple testing (Benjamini-Hochberg);
- Odds.Ratio: The ratio of the proportion of genes in the query gene set annotated to the corresponding GO term divided by the proportion of genes in the background gene set of the corresponding GO term;
- Combined.Score: calculated as ln(p)*z. P represents the p-value computed using the Fisher exact test. Z represents the z-score computed by assessing the deviation from the expected rank.
- Genes: the list of all gene hits of the corresponding GO term according to the query gene set.
Table S18. TFs with target genes enriched for ASD genes in each cell type.
Variables
- TF: transcription factor;
- Overlap: Gene hits, sample/background;
- Odds.Ratio: The ratio of the proportion of genes in the query gene set annotated to the corresponding GO term divided by the proportion of genes in the background gene set of the corresponding GO term;
- P-value: computed p-value of the Fisher’s exact test;
- FDR: Adjusted p-values, corrected for multiple testing (Benjamini-Hochberg).
Table S22. GO term enrichment of cTWAS NPD genes.
Variables
- Term: GO term and ID;
- Overlap: Gene hits, sample/background;
- P.value: computed p-value of the Fisher’s exact test;
- Adjusted.P.value: Adjusted p-values, corrected for multiple testing (Benjamini-Hochberg);
- Odds.Ratio: The ratio of the proportion of genes in the query gene set annotated to the corresponding GO term divided by the proportion of genes in the background gene set of the corresponding GO term;
- Genes: the list of all gene hits of the corresponding GO term according to the query gene set.
Table S26. Percentage of ASoC SNPs that can be assigned to a Micro-C based cis-target gene in a matching context.
Variables
- Type: cell type;
- ASoC: ASoC set found in the stimulation state of the corresponding cell types;
- Micro-C: micro-c interaction found in stimulation time of the corresponding cell types (categorised as 0 hr and 1/6 hr and represented unstimulated and stimulated states);
- Interacting bins/with targets: the count of interaction pairs of the corresponding cell type and stimulation stage combinations;
- % of ASoC: interacting ASoCs as the percentage of total ASoCs at the specific cell type and stimulation stage combination.
Table S27. ASoC status of Schizophrenia (SCZ) GWAS risk SNPs and their LD proxies (R2>0.8) in each cell type and time point. Brain eQTL and Micro-C target annotations are from column BN to BT.
Variables
- GWAS_index: rsid of the proxy SNP;
- ASOC_snp: SNP rsid;
- NCALLED: number of total samples collated in analysis;
- REF_N: reference allele count;
- ALT_N: alternative allele count;
- DP: depth;
- pVal: nominal p-value of the binominal test;
- FDR: Adjusted p-values, corrected for multiple testing (Benjamini-Hochberg).
- Brain_eQTL: rsid of identified brain eQTL SNPs;
- CommonMind/GTEx eQTL: rsid of identified CommonMind/GTEx eQTL SNPs;
- GTEx eQTL: rsid of identified GTEx eQTL SNPs;
- iPSC neuron Eqtl: rsid of identified iPSC-derived neuron eQTL SNPs;
- Micro_C: target gene from Micro-C analysis;
- Time: stimulation stages where the corresponding SNP was found significant;
- Target: potential targeted genes, if any.
Table S28. ASoC status of Bipolar disorder (BP) GWAS risk SNPs and their LD proxies (R2>0.8) in each cell type and time point. Brain eQTL and Micro-C target annotations are from column BN to BT.
Variables
- GWAS_index: rsid of the proxy SNP;
- ASOC_snp: SNP rsid;
- NCALLED: number of total samples collated in analysis;
- REF_N: reference allele count;
- ALT_N: alternative allele count;
- DP: depth;
- pVal: nominal p-value of the binominal test;
- FDR: Adjusted p-values, corrected for multiple testing (Benjamini-Hochberg).
- Brain_eQTL: rsid of identified brain eQTL SNPs;
- CommonMind/GTEx eQTL: rsid of identified CommonMind/GTEx eQTL SNPs;
- GTEx eQTL: rsid of identified GTEx eQTL SNPs;
- iPSC neuron Eqtl: rsid of identified iPSC-derived neuron eQTL SNPs;
- Micro_C: target gene from Micro-C analysis;
- Time: stimulation stages where the corresponding SNP was found significant;
- Target: potential targeted genes, if any.
Table S29. ASoC status of Major depression disorder (MDD) GWAS risk SNPs and their LD proxies (R2>0.8) in each cell type and time point. Brain eQTL and Micro-C target annotations are from column BN to BT.
Variables
- GWAS_index: rsid of the proxy SNP;
- ASOC_snp: SNP rsid;
- NCALLED: number of total samples collated in analysis;
- REF_N: reference allele count;
- ALT_N: alternative allele count;
- DP: depth;
- pVal: nominal p-value of the binominal test;
- FDR: Adjusted p-values, corrected for multiple testing (Benjamini-Hochberg).
- Brain_eQTL: rsid of identified brain eQTL SNPs;
- CommonMind/GTEx eQTL: rsid of identified CommonMind/GTEx eQTL SNPs;
- GTEx eQTL: rsid of identified GTEx eQTL SNPs;
- iPSC neuron Eqtl: rsid of identified iPSC-derived neuron eQTL SNPs;
- Micro_C: target gene from Micro-C analysis;
- Time: stimulation stages where the corresponding SNP was found significant;
- Target: potential targeted genes, if any.
Table S31. Result of single cell DE analysis in neurons of each context (Cell type x time point after stimulation) between 28 SCZ cases and matched controls. BH_FDR values were derived from the MAST p-value for each context.
Variables
- geneIDs: gene symbols (GENCODE v35);
- log2FC: log2 fold change of the corresponding gene between groups;
- pvalue: p-values associated with the t-statistic of the corresponding gene;
- padj: Adjusted p-values, corrected for multiple testing (Bonferroni method);
- pct.case: percentage of cells expressing the corresponding gene in SCZ case group;
- pct.ctrl: percentage of cells expressing the corresponding gene in SCZ control group;
- BH_FDR: Adjusted p-values, corrected for multiple testing (Benjamini-Hochberg method);
- Category: the cell types and stimulation stages where the SCZ DEG analysis is performed;
- C6_Cholesterol genes: is the corresponding gene found in the C6 cholesterol gene list;
- PGC3_SCZ genes: is the corresponding gene found in the PGC3 SCZ gene list;
- caQTL_cTWAS gene: is the corresponding gene found as a member of the caQTL-cTWAS list;
- SZ SCHEMA rare risk genes: is the corresponding gene found in the SCZ SCHEMA rare risk genes list;
- ASD_102 genes: is the corresponding gene found in the autism spectrum disorder (ASD) risk gene list.
Table S32. sgRNA sequences and PCR primer sequences as well as qPCR assays.
Variables
- sgRNA sequences, PCR primer sequences and Assay IDs of ThermoFisher qPCR assays.
Table S33. GWAS datasets used in enrichment tests (MAGMA, TORUS, sLDSC and cTWAS).
Variables
- Trait: GWAS disease trait;
- Id: database name in abstract form;
- Link: original publication;
- download link: download link of the corresponding GWAS datasets.
Access information
Access information
Data was derived from the following sources:
- NCBI SRA PRJNA1194194
- NCBI GEO GSE286488
- Zenodo repository https://zenodo.org/records/14577991
