Data from: Cross-species variant-to-function analyses implicate MEIS1 in conferring sleep abnormalities and impaired cerebellar development
Data files
May 27, 2026 version files 3.95 MB
-
18.tcf12R1_B03_029.ab1
219.99 KB
-
18.tcf12R1_B03_029.seq
353 B
-
20.NegtcfR1_D03_025.ab1
221.23 KB
-
20.NegtcfR1_D03_025.seq
365 B
-
21.arfgap2F1_E03_023.ab1
220.58 KB
-
21.arfgap2F1_E03_023.seq
352 B
-
23.NegarfF1_G03_019.ab1
221.52 KB
-
23.NegarfF1_G03_019.seq
361 B
-
41.cbx1aF1_A06_048.ab1
210.96 KB
-
41.cbx1aF1_A06_048.seq
266 B
-
45.negcbx1aF1_E06_040.ab1
208.70 KB
-
45.negcbx1aF1_E06_040.seq
241 B
-
47.gnb3bF1_G06_036.ab1
209.35 KB
-
47.gnb3bF1_G06_036.seq
249 B
-
51.neggnb3bF1_C07_059.ab1
209.28 KB
-
51.neggnb3bF1_C07_059.seq
251 B
-
52.skiv2lR2_D07_057.ab1
209.75 KB
-
52.skiv2lR2_D07_057.seq
268 B
-
56.NegSkivR1_H07_049.ab1
209.93 KB
-
56.NegSkivR1_H07_049.seq
263 B
-
65.meis1bF1_A09_079.ab1
205.94 KB
-
65.meis1bF1_A09_079.seq
239 B
-
67.Negmeis1bF1_C09_075.ab1
205.06 KB
-
67.Negmeis1bF1_C09_075.seq
236 B
-
70.cbx1bR1_F09_069.ab1
207.32 KB
-
70.cbx1bR1_F09_069.seq
249 B
-
72.Negcbx1bR1_H09_065.ab1
307.61 KB
-
72.Negcbx1bR1_H09_065.seq
941 B
-
76.gnb3aR2_D10_074.ab1
209.27 KB
-
76.gnb3aR2_D10_074.seq
262 B
-
86.meis1a2R2_F11_085.ab1
222.06 KB
-
86.meis1a2R2_F11_085.seq
367 B
-
88.negmeis1a2R1_H11_081.ab1
218.80 KB
-
88.negmeis1a2R1_H11_081.seq
334 B
-
CRISPR_Allgenes_OutliersMarked.xlsx
181.46 KB
-
README.md
4.24 KB
-
ZebrafishSleepData_Zimmerman2025_DRYAD.xlsx
36.57 KB
Abstract
Automated video tracking of larval zebrafish provides a method for high-throughput analysis of sleep-wake behaviors. This method, paired with in vivo gene editing, facilitates screening genes associated with sleep and wake, providing an avenue for the identification of novel behavioral modifiers. Here, we provide the data from a larval zebrafish behavioral screen of candidate genes associated with human insomnia. Behavioral data were collected using automated video monitoring with the Zebrabox system from ViewPoint Life Sciences to analyze sleep and activity metrics in F0 crispants for insomnia-associated genes identified from human GWAS. Statistical analyses and individual sleep and activity parameters are provided from all experiments. Sanger sequencing files are also provided, confirming successful mutation of the target gene.
Description of the data and file structure
Data are available for reuse. The authors have no conflicts to disclose.
Zebrafish sleep analysis output metrics.
Data were collected using ViewPoint Software and first analyzed in MATLAB. Statistical comparisons were performed in SAS.
Eleven sleep and activity parameters were analyzed for each crispant (gene manipulation group) vs. negative control-injected siblings.
All behavioral measures are provided in the file "CRISPR_Allgenes_OutliersMarked." This file contains the averages across the two days in which sleep is captured (days 6 and 7 post-fertilization). All fish from each clutch are included, and fish that were considered extreme outliers by visual inspection are identified. These were likely representative of fish that died or became unhealthy during the assay,y or had debris or a bubble obscuring the camera.
All statistical comparisons were performed within each gene-control pair. No comparisons were made between different genes.
Primary comparisons between crispant and negative control-injected siblings were performed using both parametric t-test and non-parametric Wilcoxon rank-sum tests to mitigate any potential impact of non-normality of the endpoints. A Hochberg step-up procedure[1],[2] was applied to the analysis of each gene independently to maintain gene-specific type I error at the desired level of 0.05 across the 11 tested hypotheses. Notably, this multiple testing correction procedure defines a threshold for significance based on p-value ranks. The behavioral measures deemed significant by this threshold are bolded in the file "ZebrafishSleepData_Zimmerman2025_DRYAD."
[1] Hochberg Y. A sharper Bonferroni procedure for multiple tests of significance. *Biometrika. *1988:75(4):800-802.
[2] Huang Y and Hsu J.C. Hochberg’s step-up method: cutting corners off Holm’s step-down method. Biometrika. 2007:94(4):965-975
Output:
- n_Control/ n_[Gene name] = sample size
- musd = mean (standard deviation)
- muci = mean (95% confidence interval of the mean)
- medrng = median (range)
- diffci95 = mean difference (95% confidence interval)
- smd = standardized mean difference (effect size)
- pc_ttest = p-value corrected t-test (assumes normality)
- hochsig_ttest = Did the corrected p-value reach the significance threshold for the Hochberg step-up procedure following multiple t-test correction?
- pc_rnksum = p-value corrected following rank sum test (does not assume normality)
- hochsig_rnksum = Did corrected p-value reach significance threshold for Hochberg step-up procedure following rank sum test [this option was used to determine significance, as many activity traits are non-normally distributed]
- Pooled SD = pooled standard deviation - used to calculate SMD
- SE of SMD = standard error of SMD - used to calculate confidence intervals of SMD
- LI 95 CI = lower interval for the 95% Confidence Interval of the SMD
- UI 95 CI = upper interval for the 95% Confidence Interval of the SMD
All behavioral traces for days 6 and 7 post fertilization,n as well as individual data points, are plotted in the file "Zimmermanetal-SupplementaryBehaviorData."
Data are shown for each clutch separately.
Sanger Sequencing files are provided for each gene target as a sequencing file (.seq) and a Sanger trace (.ab1). These files were used to generate Supplementary Figure 2. Each file represents the guideRNA target amplicon generated using target-specific primers for a single crispant fish, showing the on-target mutation generated by the guideRNA. File names indicate gene name [meis1a, meis1b, cbx1a, cbx1b, gnb3a, gnb3b, skiv2l, tcf12, arfgap2] or negative control sibling [negmeis1b = negative control sibling of meis1b crispant]. Forward or Reverse strand is indicated by (F) or (R). The remaining letters are lane/sample identifiers. (e.g. meis1bF1_A09_079 = meis1b crispant, Forward strand). Corresponding .seq and .ab1 files are labeled with the same name. Sample QC was provided by the University of Pennsylvania Sanger Sequencing Core. No post-processing was applied.
All embryos and larvae were housed in an incubator at 28.5°C, with lights on at 9am. (ZT0) and lights off at 11pm (ZT14) prior to data collection. Dead embryos and chorion membranes were removed daily until day 5 post fertilization. On day 5, CRISPR mutants and scramble-injected sibling controls were screened for gross morphological deficits and healthy larvae were pipetted into individual wells of a 96-well plate and placed into a Zebrabox (ViewPoint Life Sciences) for automated video monitoring. Genotypes were placed into alternating rows to minimize location bias within the plate. All animals were allowed to acclimate to the Zebrabox for approximately 24 hours before beginning continuous data collection for 48 hours starting at lights-on (9am). Each Zebrabox is sound-attenuating and contains circulating water held at a temperature of 28.5ºC with automated lights cycling on the same 14-hour:10-hour light/dark cycle. Sleep-wake behaviors were measured through automated video-tracking, as described previously (Kroll et al., 2021; Palermo et al., 2023; Rihel et al., 2010). At least two biological replicates were run for each gene using different clutches (sibling-matched within clutch) of embryos and well placement was flipped for each experiment to minimize location bias across experiments. Activity data were captured using automated video tracking (Viewpoint Life Sciences) software in quantization mode (Palermo et al., 2023). As described previously (Chen et al., 2017), threshold for detection was set as the following: detection threshold: 20; burst: 29; freeze: 3; bin size: 60 seconds. Data were processed using custom MATLAB scripts (Lee et al., 2022) to calculate the following parameters for both day and night separately: sleep duration (minutes/hour), activity duration (seconds/hour), waking activity (seconds/awake minute/hour), sleep bout length (minutes/bout), sleep bout number (number/hour) and nighttime sleep latency (minutes).
Statistical analysis and control for multiple comparisons across sleep traits
Sample size determination was based on previous work by Kroll et al, indicating 48 larvae per group (half of one 96-well plate) is sufficient to detect mutant phenotypes at a power of 0.8 and significance level of 0.05. Experiments were repeated in the same Zebrabox (ViewPoint Life Sciences) at least twice. Effect sizes are described as standardized mean difference (SMD) and 95% confidence intervals (CI). Phenotypes of interest included a total of eleven measurements related to sleep and activity, including day and night sleep duration, activity, waking activity, sleep bout length, and sleep bout number, as well as nocturnal sleep latency. Primary comparisons between crispant and negative control-injected siblings were performed using a Hochberg step-up procedure that was applied independently to analysis of each gene to maintain gene-specific type I error at the desired level of 0.05 across the tested hypotheses.
