Metadata Resources


The resources below provide an introduction to metadata and different metadata annotation levels, and include explanations of why researchers should annotate their studies at each level.

Variable Standards Finder Tool

Metadata in the HEAL Data Ecosystem


The NIH defines metadata as "Data that provide additional information intended to make scientific data interpretable and reusable (e.g., date, independent sample and variable construction and description, methodology, data provenance, data transformations, any intermediate or descriptive observational variables)."

The HEAL Data Ecosystem is powered by metadata; information describing data. By sharing study metadata, investigators make their data more Findable, Accessible, Interoperable, and Reusable (FAIR). SLMD, VLMD, and CDE information supports the HEAL Data Platform and HEAL Semantic Search, promoting research discovery. For a brief overview on the importance of data sharing in the HEAL Data Ecosystem, see the Fresh FAIR Webinar Recap: Advancing Open Science with the HEAL Data Ecosystem.

The HEAL Data Ecosystem incorporates three types of research study metadata:

  • Study-level metadata (SLMD): describes a HEAL-funded study.
  • Variable-level metadata (VLMD): describes the variables collected by a study.
  • Common data elements (CDEs) usage information: schemas that contain standardized variable descriptions, collection protocols, question text, units, and permissible values. Using the same CDEs can enable data collected by different studies to be harmonized. Studies are encouraged (and, in some cases, required) to use HEAL CDEs.