Morphodynamic atlas for Drosophila development
Data files
Dec 13, 2022 version files 181.46 GB
-
67-15_1to3copies_UAS-Even-Skipped.tar.lz4
610.81 MB
-
67-singlecopy_15-singlecopy_UAS-18W-HA.tar.lz4
87.07 MB
-
67-singlecopy_15-singlecopy_UAS-Even-Skipped.tar.lz4
973.31 MB
-
67-singlecopy_15-singlecopy_UAS-Runt.tar.lz4
1.52 GB
-
67-singlecopy_15-singlecopy_UAS-Tollo-HA.tar.lz4
161.77 MB
-
67-singlecopy_UAS-Even-Skipped.tar.lz4
113.88 MB
-
67-singlecopy_UAS-Runt.tar.lz4
737.21 MB
-
bicoid_E1__nanos_EN__tsl_4_.tar.lz4
2.73 GB
-
bicoid_E1__nanosBN_.tar.lz4
2.24 GB
-
concertina-t48.tar.lz4
179.10 MB
-
embryo_geometry.tar.lz4
482.41 MB
-
even-skipped_r13_.tar.lz4
14.71 GB
-
Halo_Hetero_Even-Skipped_r13__Hetero.tar.lz4
283.24 MB
-
Halo_Hetero_Snail_IIG05__Hetero.tar.lz4
2.76 GB
-
Halo_Hetero_Twist_ey53__Hetero.tar.lz4
6.26 GB
-
Halo_snail_IIG05_.tar.lz4
7.06 GB
-
Halo_twist_ey53_.tar.lz4
3.14 GB
-
opto-RhoGEF2.tar.lz4
5.12 GB
-
README.md
9.89 KB
-
spaetzle_A_.tar.lz4
12.45 GB
-
sqh_Ax3_.tar.lz4
6.59 GB
-
timing.tar.lz4
125.49 MB
-
toll_RM9_.tar.lz4
21.55 GB
-
TrafficJam-Gal4_UAS-Fat2RNAi.tar.lz4
5.36 GB
-
UAS-Even-Skipped.tar.lz4
388.55 MB
-
WT__INTERNAL__concatenated_fixed_data.tar.lz4
1.85 GB
-
WT_Bazooka-GFP.tar.lz4
420.54 MB
-
WT_Bazooka.tar.lz4
47.19 MB
-
WT_CAAX-mCherry.tar.lz4
6.19 GB
-
WT_ECad-GFP.tar.lz4
4.55 GB
-
WT_endogenous_ECad-GFP.tar.lz4
5.63 GB
-
WT_Even_Skipped-YFP.tar.lz4
3.65 GB
-
WT_Even_Skipped.tar.lz4
1.21 GB
-
WT_Fushi_Tarazu.tar.lz4
470.98 MB
-
WT_H2a-mCherry_Klarsicht.tar.lz4
6.97 GB
-
WT_Hairy.tar.lz4
238.80 MB
-
WT_histone-RFP.tar.lz4
1.92 GB
-
WT_Moesin-GFP.tar.lz4
6.04 GB
-
WT_Neurotactin.tar.lz4
1.52 GB
-
WT_Paired.tar.lz4
431.51 MB
-
WT_Runt_examples.tar.lz4
166.75 MB
-
WT_Runt.tar.lz4
6.78 GB
-
WT_Shotgun-GFP.tar.lz4
4.69 GB
-
WT_Sloppy_Paired.tar.lz4
444.69 MB
-
WT_Sqh_greater_RokK116A-GFP.tar.lz4
724.51 MB
-
WT_Sqh-GFP.tar.lz4
36.72 MB
-
WT_Sqh-mCherry__.tar.lz4
21.03 GB
-
WT_Sqh-mCherry_17_Degrees.tar.lz4
2.54 GB
-
WT_Sqh-mCherry_27_Degrees.tar.lz4
4.16 GB
-
WT_Tartan.tar.lz4
529.24 MB
-
WT_Toll_6.tar.lz4
224.96 MB
-
WT_Toll_8.tar.lz4
466 MB
-
WT_ubiquitous_Rock-GFP.tar.lz4
517.26 MB
-
WT_utr-mCherry.tar.lz4
2.39 GB
Dec 13, 2024 version files 170.81 GB
-
67-15_1to3copies_UAS-Even-Skipped.tar.lz4
610.86 MB
-
67-singlecopy_15-singlecopy_UAS-18W-HA.tar.lz4
182.62 MB
-
67-singlecopy_15-singlecopy_UAS-Even-Skipped.tar.lz4
1.04 GB
-
67-singlecopy_15-singlecopy_UAS-Runt.tar.lz4
1.52 GB
-
67-singlecopy_15-singlecopy_UAS-Tollo-HA.tar.lz4
161.78 MB
-
67-singlecopy_UAS-Even-Skipped.tar.lz4
113.97 MB
-
67-singlecopy_UAS-Runt.tar.lz4
737.74 MB
-
bicoid_E1__nanos_BN_.tar.lz4
2.24 GB
-
bicoid_E1_nanos_BN_tsl_4.tar.lz4
2.73 GB
-
bicoid_E1_nanos_BN.tar.lz4
2.24 GB
-
concertina-t48.tar.lz4
179.26 MB
-
embryo_geometry.tar.lz4
481.87 MB
-
even-skipped_r13.tar.lz4
14.89 GB
-
Halo_Hetero_even-skipped_r13_Hetero.tar.lz4
283.56 MB
-
Halo_Hetero_snail_IIG05_Hetero.tar.lz4
2.76 GB
-
Halo_Hetero_twist_ey53_Hetero.tar.lz4
6.38 GB
-
Halo_snail_IIG05.tar.lz4
7.17 GB
-
Halo_twist_ey53.tar.lz4
3.26 GB
-
INTERNAL_PIVlab_results.tar.lz4
1.13 GB
-
INTERNAL_timing.tar.lz4
125.53 MB
-
INTERNAL_WT_concatenated_fixed_data.tar.lz4
1.85 GB
-
Morphodynamic_Atlas.csv
121.81 KB
-
README.md
12.45 KB
-
spaetzle_A.tar.lz4
12.60 GB
-
sqh_Ax3.tar.lz4
6.59 GB
-
toll_RM9.tar.lz4
21.65 GB
-
TrafficJam-Gal4_UAS-Fat2RNAi.tar.lz4
5.36 GB
-
UAS-Even-Skipped.tar.lz4
388.72 MB
-
WT_17_Degrees.tar.lz4
2.69 GB
-
WT_27_Degrees.tar.lz4
4.26 GB
-
WT_Bazooka-GFP.tar.lz4
420.87 MB
-
WT_Bazooka.tar.lz4
47.23 MB
-
WT_CAAX-mCherry.tar.lz4
6.19 GB
-
WT_ECad-GFP.tar.lz4
4.55 GB
-
WT_endogenous_ECad-GFP.tar.lz4
5.64 GB
-
WT_Even_Skipped-YFP.tar.lz4
3.65 GB
-
WT_Even_Skipped.tar.lz4
964.39 MB
-
WT_Fushi_Tarazu.tar.lz4
134.36 MB
-
WT_H2a-mCherry_Klarsicht.tar.lz4
6.97 GB
-
WT_Hairy.tar.lz4
57.17 MB
-
WT_histone-RFP.tar.lz4
1.92 GB
-
WT_Moesin-GFP.tar.lz4
6.04 GB
-
WT_Neurotactin.tar.lz4
342.70 MB
-
WT_Paired.tar.lz4
82.26 MB
-
WT_Runt.tar.lz4
4.18 GB
-
WT_Shotgun-GFP.tar.lz4
4.69 GB
-
WT_Sloppy_Paired.tar.lz4
107.64 MB
-
WT_Sqh-GFP.tar.lz4
36.74 MB
-
WT_Sqh-mCherry.tar.lz4
17.13 GB
-
WT_Sqh-RokK116A-GFP.tar.lz4
725.04 MB
-
WT_Tartan.tar.lz4
121.95 MB
-
WT_Toll_6.tar.lz4
91.94 MB
-
WT_Toll_8.tar.lz4
165.75 MB
-
WT_ubiquitous-Rock-GFP.tar.lz4
517.66 MB
-
WT_utr-mCherry.tar.lz4
2.39 GB
Abstract
This data repository contains the Drosophila morphodynamic atlas, a collection of 478 in-toto lightsheet-microscopy recordings of the Drosophila embryo at the blastoderm stage (stages 6-9). The repository comprises 18 different genotypes. For details on the data and its formating, please see "README.md". A table of all recordings can be found in "MorphodynamicAtlas.csv". A tutorial for querying and loading data from the atlas using Python can be found in "Morphodynamic_Atlas_Tutorial.ipynb". For details of how the atlas was created and how it can be used, please see the publication.
Publication abstract: During morphogenesis, diverse cell-scale and tissue-scale processes couple to dynamically sculpt organs. In this coupling, genetic expression patterns and biochemical signals regulate and respond to mechanical deformations to ensure reproducible and robust changes in tissue geometry. A long-standing approach to characterize these interactions has been the construction of expression atlases, and these atlases have necessarily relied on fixed snapshots of embryogenesis. Addressing how expression profiles relate to tissue dynamics, however, requires a scheme for spatiotemporal registration across different classes of data that incorporates both live samples and fixed datasets. Here, we construct a morphodynamic atlas that unifies fixed and live datasets – from gene expression profiles to cytoskeletal components – into a single, morphological consensus timeline. This resource and our computational approach to global alignment facilitate hypothesis testing using quantitative comparison of data both within and across ensembles, with resolution in both space and time to relate genes to tissue rearrangement, cell behaviors, and out-of-plane motion. Examination of embryo kinematics reveals stages in which tissue flow patterns are quasi-stationary, arranged as a sequence of ‘morphodynamic modules’. Temperature perturbations tune the duration of one such module – during body axis elongation – according to a simple, parameter-free scaling in which the total integrated tissue deformation is achieved at a temperature-dependent rate. By extending our approach to visceral organ formation during later stages of embryogenesis, we highlight how morphodynamic atlases can incorporate complex shapes deforming in 3D. In this context, morphodynamic modules are reflected in some, but not all, measures of tissue motion. Our approach and the resulting atlas open up the ability to quantitatively test hypotheses with resolution in both space and time, relating genes to tissue rearrangement, cell behaviors, and organ motion.
README: Morphodynamic atlas of Drosophila development
Dynamic altas of early Drosophila development created using in-toto lightsheet microscopy and tissue cartography.
Reference Information
Dataset contributors
Mitchell, Noah, University of California, Santa Barbara
Lefebvre, Matthew, University of California, Santa Barbara
Jain-Sharma, Vishank, University of California, Santa Barbara
Claussen, Nikolas, University of California, Santa Barbara
Raich, Marion, Technical University Munich
Gustafson, Hannah, University of California, Santa Barbara
Bausch, Andreas, Technical University Munich
Streichan, Sebastian, University of California, Santa Barbara
Related manuscript
The dataset presented here is described in the manuscript "Morphodynamic Atlas for Drosophila Development" by Mitchell et al., https://www.biorxiv.org/content/10.1101/2022.05.26.493584v1. The reader can find details about the motivation, experimental and computer vision methods that underly the dataset.
Funding
This work was supported by:
- National Institutes of Health, Award: R35 GM138203
- National Science Foundation, Award: PHY- 1748958
- National Science Foundation, Award: PHY-1707973
License
This work is licensed under a CC0 1.0 Universal (CC0 1.0) Public Domain Dedication license.
About this README
Author: Nikolas Claussen, nclaussen@ucsb.edu. Dated 11/28/2022.
Description of the data and folder structure
Overview
The data, showing recordings of the surfaces of mostly early Drosophila embryos (stages 5-9), is organized in folders first by genotype of the fly line imaged (e.g. 'WT' for wild type), and then by the fluorophore visualized (e.g. Histone-RFP). Each folder in a genotype/fluorophore combination is indexed by the date in YYYY-MM-DD-HH-MM format, and contains data pertaining to a single experiment. For example, the folder 'toll[RM9]/Bazooka/201908151459' contains data about an embryo from the fly line toll[RM9], stained with an antibody for the protein Bazooka, and imaged on 15/08/2019.
This dataset also contains a set of live recordings of WT embryos at temperatures much higher or lower than standard room temperature conditions (17 and 27 degrees Celsius, respectively). For the purpose of organizing this dataset, these experiments are considered their own "genotype", "WT_17_Degrees" and "WT_27_Degrees".
A list of all datasets is available as a the .csv file 'Morphodynamic_Atlas.csv'. Each row corresponds to a dataset, and the different columns contain the relevant information.
The atlas can be queried (e.g. to find all recordings with a given fluorophore) either using a spread sheet program like Excel, or in Python. A short Python tutorial is provided in 'Morphodynamic_Atlas_Tutorial.ipynb'. The tutorial is also available as a .pdf so you can view it without having Python installed.
The meaning of the columns is explained in what follows, and in the first row of the .csv file:
'Embryo_ID': ID of embryo (time of recording), in YYYYMMDDHHMM format
'Genotype': Genotype. For organization purposes, recordings at different temperatures are considered genotypes.
'Fluorofore': Fluorescently tagged protein. For live recordings, we distinguish different tags, e.g. Sqh-GFP and Sqh-mCherry
'Is_Live': Live or fixed recording
'Has_PIV': Whether PIV (tissue flow fields) are available. Saved in a PIV/ directory in the same folder as the underlying images
'Has_PIVLab': Whether high-resolution PIV data (created by PIV lab is available. Saved as XXX_PIVlab.mat file in the same folder as the underlying images
'Timing_PIV': Timing based on PIV. Indicates frame of movie where PIV switches from ventral furrow to GBE pattern, based on autocorrelation matrix.
'Timing_Runt': Timing based on Runt stripe 7 deformation. Indicates the corresponding time in Runt reference embryo 202001150004
'Timing_Warped': Timing based on Runt 7 stripe deformation, after construction of consensus timeline via warping
'Timing_PMG_VF': Timing based ventral or cephalic furrow for alignment across mutant genotypes
'Filename': Name of .tif file with image data. Note: some embryos may have multiple .tif files available, e.g. max- and mean projections
'Time_Resolution_Seconds': Time resolution in seconds
'Notes': Any additional notes about the dataset
The folders containing the data were zipped into .tar.lz4 archives, with one zipped file for each genotype (e.g. 'toll[RM9]'), with the exception of WT, were, due to the large size of the dataset, each fluorophore has its own zipped file (e.g. 'WT_Sqh-GFP'). The folders were saved as a .tar.lz4 archives, and decompression can be done in a UNIX shell via:
~$ lz4 -d folder_name.tar.lz4 -c | tar xvf -
Tissue cartography conventions
The datasets show the surface of the Drosophila embryo. The Drosophila egg is a curved, ellipsoidal surface, which in this dataset is mapped to the plane by a cylindrical chart in which anterior is left, posterior right, and the ventral side corresponds to the top and bottom edge of the image. See also below, "Timing and cartography metadata".
Multichannel data
Importantly, many recordings in the morphodynamic atlas are multichannel and show two or more different fluorophores. For example, for the WT embryo identifies by timestamp "202001072039", we have a recording of both the proteins "Hairy" and "Runt". The data from the two channels can be found in the respective subfolders /WT/Hairy/202001072039 and /WT/Runt/202001072039. To see whether an embryo of interest was also imaged for other channels, use file search by the embryo time stamp.
Data available for each embryo recording
As explained above, each folder of the form /[genotype]/[fluorophore]/[timestamp] contains data about a single recording.
We now explain what type of data files are available.
Importantly, the dynamic atlas contains both live datasets (i.e. movies of embryos developing inside the microscope), and fixed datasets (embryos killed, fixed, and stained for a particular protein at one moment in development). We first explain what data is available for live recordings:
- .tif files. These are the actual images / movies. Movies are saved as .tif stacks containing multiple frames. Often, more than one .tif file is available. The reason for this is that the movies are z-projections of 3d datasets (as explained in the manuscript linked below), and often, different projections are included. The type of projection is indicated by the file name: "MEAN/mean" indicates a mean of z-slices, "MAX/max" a maximum projection, and "basal" a projection focusing on the basal layer of the cell. Note that for historical reasons, the naming convention for the main .tif files can vary between embryos and genotypes, but the files always show the same information.
- PIV vector fields. For most live datasets, we include a quantification of tissue flow via Particle Image velocimetry. The resulting flow vector fields are saved as .mat file for each timepoint in a separate sub folder, e.g. "PIV". Units of the PIV fields are pixels/frame. Sometimes, more than one version of PIV is included, when different filters where applied (e.g. "PIV_filtered" stores post-processed PIV flow fields in which the velocity field was smoothed), or when a different program was used to compute the PIV (e.g. "PIVlab", using the PIVLab MATLAB plugin). Details on the PIV methodology can be found in the manuscript.
- timing data. For two genotypes, "WT" and "Halo-Hetero Twist[ey53]-Hetero", we used the methodology described in the manuscript to generate timestamps, i.e. for each movie, the frame which corresponds to a common timepoint along the fly embryo morphogenetic trajectory. This is contained in a file called "t0V.txt".
- unit files. All movies in the atlas where taken at a spatial resolution of 0.2619 microns/pixel (though see section below on cartography metadata), and at a time resolution of 1 min?frame, except otherwise noted, but some folders contain files "dt.txt" and "pix2mum.txt", describing the tmeporal resolution in minutes, and the spatial resolution in microns/pixel.
- "info" files. These files are located at the level of the parent folder (i.e. /[genotype]/[fluorofore]) and contain relevant information that applies to the entire set of recordings contained in the folder.
- other files, e.g. "pullbackPathlines", that are of internal nature and where used, for example, to computionally generate the common morphological timeline explained in the manuscript.
The following type of datafiles is available for fixed recordings.
- .tif files. As for live data, these are the actual images, potentially available in different z-projected versions ("MEAN" and "MAX").
- "info" files. These files are located at the level of the parent folder (i.e. /[genotype]/[fluorofore]) and contain relevant information that applies to the entire set of recordings contained in the folder.
For data from the wild type phenotype (WT), additional data is available:
- timing data. WT fixed embryos were time-aligned using the method described in the manuscript. Briefly, all embryos were co-stained with a second marker, Runt, for which live recordings are available. Comparison of a morphological landmark (the shape of Runt stripe 7) to the live data enabled time stamping. For time stamped data, there is a file "timematch_Runt_Runtstripe7_chisq" containing both the time stamp (in minutes, with respect to the live master timeline), and the uncertainty of the timing (in minutes). This file is available as .txt and as .mat. The directory containing the Runt channel of such a fixed embryo contains additional files used by the program that generates the time stamps, notably plots ("stripe7_ssr[...].png", "Runtstripe7_rgb_[...].png") and data on the location and shape of the morphological landmark ("Runt_stripe7curve.mat", "Runt_mask.tif").
- smoothed data. For many WT fixed embryos, per-computed smoothed versions of the image, as well as images showing the x- and y- gradients in fluorescent intensity, are available in sub-folders entitled "sigma050_step001" ect. Here sigma is the standard deviation of the Gaussian used for smoothing, and step the step size used to compute the gradients. This information was computed for use in the paper "Geometric control of Myosin-II orientation during axis elongation" by Lefebvre & Claussen et al., https://www.biorxiv.org/content/10.1101/2022.01.12.476069v1.
Datasets - special cases
In addition to recordings of different fly genotypes during early development, the dataset also includes a movie of a nuclear marker that covers the entire duration of embryo development (WT/H2a-mCherry_Klarsicht/201802181715), and data from opto-genetic experiments (in the genotype folder "opto-RhoGEF2"). In these experiments, the embryo was not simply observed, but manipulated using optogenetic activation, as described in the manuscript "Patterned mechanical feedback establishes a global myosin gradient" by Gustavson et al. (DOI: 10.1038/s41467-022-34518-9).
Timing and cartography metadata
The images and movies contained in the atlas dataset were created from 3-dimensional data using tissue cartography ("Tissue cartography: compressing bio-image data by dimensional reduction", Heemskerk & Streichan 2015, DOI: 10.1038/nmeth.3648). The information defining the map from the cylindrical chart/pullback in which the movies/images of the dataset are displayed and the 3d shape of the fly embryo are contained in the folder "embryo_geometry". Here, there are different .ply 3d meshes of the embryo surfaces, .mat data files and sample MATLAB scripts which can be used for basic tissue cartography operations. Details are described in a separate README contained in the "embryo_geometry" folder.
The folder "timing" contains internal files used by the time-stamping algorithm described in the manuscript.z
Changes made since previously published version: There are three changes to the dataset: 1) added a .csv file listing all recordings available in the dataset, 2) added a python tutorial (jupyter notebook) on how to query the dataset and load data, 3) small renaming changes to fix spelling mistakes in folder names.
Methods
Dynamic atlas of early Drosophila development, collected using in-toto lightsheet microscopy and tissue cartography.
Usage notes
Program to view multi-page .tif files, e.g. Fiji. Spreadsheet program to view list of datasets. For the Python tutorial, a working python+jupyter installation (e.g. via anaconda).