Data Packaging Examples and Best Practices


These examples are from HEAL-funded studies that have submitted data to a HEAL-compliant repository. The datasets are publicly accessible, and the Principal Investigators have given the HEAL Stewards permission to link to their data packages. Some data types below do not have data package examples available yet. Examples will be added as they become available. In the meantime, general data sharing guidance materials are provided to help investigators prepare their data packages. While reviewing the examples below, look for the symbols that identify which core (✅) and additional (✔️) components each data package includes.

 

Animal Behavioral, lmaging, and Observational Data


Data Package example:

  • Study title: Temporal Patterns of Spinal Cord Stimulation
  • Data Package Link (SPARC)
Why this is a good example:

This data package has a clear folder structure and file naming convention that makes it easy to find the data files and understand their relationships. The data package uses repository-specific (SPARC) requirements, documenting the study protocol using Protocols.io.

Components in this Data Package:

✅ Data files
✅ README or Summary file
✅ Variable-level Metadata documentation
✅ Repository-specific documentation
✔️ Code used to conduct analyses
✔️ Study Protocol

Data Package example:

Why this is a good example:

This dataset contains EEG and EMG recordings conducted in non-human primates. In addition to all the core components, this data package includes study methodology and links to the software used in analysis.

Components in this Data Package:

✅ Data files
✅ README or Summary file
✅ Variable-level Metadata documentation
✅ Repository-specific documentation
✔️ Code used to conduct analyses
✔️ Publication Citations
✔️ Context or explanatory documents

EEG-Related Resources:

Best Practices:

For non-human subjects data, such as video recordings of mice behavior, data repository options may be limited by large file sizes. Audiovisual recordings involving animals may raise ethical, regulatory, or programmatic considerations that extend beyond standard identifiability concerns. Research teams should carefully assess these factors when determining what to share and how to document the materials. Researchers may choose to share processed data from audiovisual recordings in addition to or rather than the raw recordings.

Sharing data behind access controls is an appropriate safeguard to manage sensitive data concerns.