Summary
Objective: The current observational research literature shows extensive publication bias and
contradiction. The Observational Health Data Sciences and Informatics (OHDSI) initiative
seeks to improve research reproducibility through open science.
Methods: OHDSI has created an international federated data source of electronic health records
and administrative claims that covers nearly 10% of the world’s population. Using
a common data model with a practical schema and extensive vocabulary mappings, data
from around the world follow the identical format. OHDSI’s research methods emphasize
reproducibility, with a large-scale approach to addressing confounding using propensity
score adjustment with extensive diagnostics; negative and positive control hypotheses
to test for residual systematic error; a variety of data sources to assess consistency
and generalizability; a completely open approach including protocol, software, models,
parameters, and raw results so that studies can be externally verified; and the study
of many hypotheses in parallel so that the operating characteristics of the methods
can be assessed.
Results: OHDSI has already produced findings in areas like hypertension treatment that are
being incorporated into practice, and it has produced rigorous studies of COVID-19
that have aided government agencies in their treatment decisions, that have characterized
the disease extensively, that have estimated the comparative effects of treatments,
and that the predict likelihood of advancing to serious complications.
Conclusions: OHDSI practices open science and incorporates a series of methods to address reproducibility.
It has produced important results in several areas, including hypertension therapy
and COVID-19 research.
Keywords
Observational research - reproducibility