HEAL Data Dictionary Preparation Guidance


This resource offers guidance for preparing variable-level metadata (often in the form of a data dictionary) to support clarity, consistency, machine processing, reuse, and alignment with the variable-level metadata schema used by the HEAL Data Ecosystem. Variable-level metadata (VLMD) is a core component of a complete HEAL data package and, along with key supporting documentation, helps others understand how your research defines, measures, and encodes variables for reuse and analysis. Following these practices can also help ensure your file is ready for use with the Platform’s VLMD tool, enabling extraction of HEAL-compliant VLMD and validation against the HEAL VLMD schema.

Overview


Preparing Your Data Dictionary

This guidance is organized into four focus areas: variable clarity and reusability, consistency across variables, structure for programmatic use, and data integrity and source fidelity. Additional submission guidance and resources can be found in the Data Dictionary Sharing Readiness Tab. Each section outlines key elements supporting high-quality, reusable VLMD. This resource supports data dictionary preparation and helps reduce submission delays; it is not a rigid pass–fail checklist, and not all elements will apply to every study or data type.