Submission requirements
Accepted data
Dryad accepts all research data and is intended for complete, re-usable, open research datasets.
- Dryad does not accept any files with licensing terms that are incompatible with the Creative Commons Zero waiver. For more information, please see Good data practices: Removing barriers to data reuse with CC0 licensing.
- Dryad does not accept data submissions containing personally identifiable information (PII). Any data involving human subjects must adhere to IRB regulations, obtain formal consent from participants for sharing, be properly anonymized, and be prepared in accordance with legal and ethical guidelines before being considered for publication. Please see additional guidance on human subjects data. Additionally, due to the potential risk for indirect re-identification of research participants, Dryad does not accept transcripts from interviews, focus groups, observation studies, or images, audio, or video recordings derived from or displaying human subjects. Properly de-identified micro-level or aggregated quantitative data and summarized qualitative data derived from human participant research submissions are acceptable pending curator analysis if the requirements mentioned above are met.
- Dryad will host code, scripts, software, and/or supplemental materials. Because data files are not always compatible with the CC0 license waiver required for publication, you will have the option to upload files via Dryad for hosting on Zenodo, which allows public software deposits with version control for the ongoing maintenance of software packages and additional licensing options for files uploaded as 'Software' or 'Supplemental information'. All files selected for upload and hosting by Zenodo will be time-released with the publication of the Dryad dataset and remain linked and accessible through the Dryad DOI.
Preferred file formats
Use CSV, TSV, or ODF formats for tabular data. Excel files (XLS or XLSX) with formatting can reduce accessibility and complicate downstream analysis. If Excel is necessary, remove any non-essential elements such as additional pages, merged cells, highlighting, embedded figures, frozen panes, comments, hyperlinks, formulas, and filters. If any of these features are essential for understanding the data, explain them clearly in the accompanying README file.
Dryad welcomes the submission of data in multiple formats to enable various reuse scenarios. For best practice, always submit a clean, unformatted version of the dataset in CSV, TSV, or ODF format alongside any formatted Excel files.
Most types of files can be submitted (e.g., text, spreadsheets, video, photographs, code) including compressed archives of multiple files. Preservation-friendly, open formats for files include:
- Text:
- README files should be in Markdown (
MD
) - Comma- or tab-separated values (
CSV, TSV
) for tabular data - Semi-structured plain text formats for non-tabular data (e.g., protein sequences)
- Structured plain text (
XML, JSON
)
- README files should be in Markdown (
- Images:
PDF, JPEG, PNG, TIFF, SVG
- Audio:
FLAC, AIFF, WAV, MP3, OGG
- Video:
AVI, MPEG, MP4
- Compressed file archive:
TAR.GZ, 7Z, ZIP
Note: RAR (Roshal ARchive) is a proprietary compression format. Because users may not have access to the necessary tools to open RAR files, we cannot accept them for publication. Please use open, widely supported, and easily accessible formats like those listed above.
File size
We recommend that individual files should not exceed 10GB. This ensures files are easily accessed and downloaded by Dryad users.
There is a limit of 300GB per data publication uploaded through the web interface.
Metadata requirements
Good metadata helps make a dataset more discoverable and reusable. The metadata should describe the data themselves, rather than the study hypotheses, results, motivations, or conclusions. A thorough description of the data file, the context in which the data were collected, the measurements that were made, and the quality of the data are all important.
We require:
- Journal name: If associated with a manuscript, fields for journal name and manuscript number are required; if associated with a published or in-press article, fields for journal name and DOI are required.
- Title: The title should be a succinct summary of both the data and study subject or focus. A good title typically contains 8 to 10 words that adequately describe the content of the dataset.
- Author(s): Name, email address, primary institutional affiliation of main researcher(s) involved in producing the data.
- Affiliations are drawn from the Research Organization Registry (ROR)
- If you provide your co-authors' email addresses, when the dataset is published, they will receive a message giving them the option to add their ORCID to the Dryad record
- Abstract: Brief summary of the dataset’s structure and concepts including information regarding values, contents of the dataset, reuse potential and any legal or ethical considerations. If this dataset is associated with a study, abstract language can be similar, but it should focus on the information relevant to the data itself, rather than to the study.
- Research domain: Primary research domain. Domains are drawn from the OECD Fields of Science and Technology classification.
- Keyword(s): Descriptive words that may help others discover your dataset. We require a minimum of 3 keywords for submission. Keywords can match those from your related research article or, if available, you can select from a controlled vocabulary for your discipline.
We recommend:
- Funding Information: Name of the funding organization that supported the creation of the resource, including applicable grant number(s). Each grant and associated award number should be input separately. Options in the drop-down menu are populated from the Research Organization Registry (ROR).
- Research facility: Where the research was conducted, if different from your current affiliation (e.g., a field station). Options in the drop-down menu are populated from the Research Organization Registry (ROR).
- Methods: Any methodological information that may help others to understand how the data were generated (i.e. equipment/tools/reagents used, or procedures followed).
- Related works: Use this field to indicate resources, other than the primary article, that are associated with the data. Examples include related datasets, preprints, etc.