Keywords
Public Health - Preventive Health - Social Determinants of Health - Population Health
1. Introduction
In the years following the complete sequencing of the human genome, much attention
was given to the idea of personalized medicine and related efforts— genomic medicine,
P4 medicine (predictive, preventive, personalized, and participatory), and later,
precision medicine [[1]
[2]
[3]
[4]]. The latter term was favored following the announcement in President Obama's 2015
State of the Union address of a new federal “Precision Medicine Initiative,” since
rebranded as the “All of Us” project [[4],[5]]. Even from the early days of the precision medicine era, despite the frequently
used picture of President Obama next to a molecular model of DNA [[6]], it was recognized that DNA is only a small part of the picture— environment and
lifestyle are critical components as well [[7]].
To understand what is meant by “precision prevention”, it may be useful to examine
this term in the context of related terms, as well as some axes that they connote.
Key related terms include precision medicine, precision population health, and precision public health, as well as public health and population health without the qualifier. It should be noted that these terms are not necessarily used
consistently, nor does everyone agree on specific definitions or nuanced differences.
However, different combinations of these terms may connote certain nuances including
target audience (individuals, groups, or entire populations), level of well-being
(from ill to thriving), type of intervention (primary prevention vs. secondary or tertiary prevention vs. treatment), level of risk (from population average to someone experiencing an adverse
event or outcome), and responsible actor (e.g., clinical care provider vs. governmental policy makers) (see Box 1).
Box 1: Axes of distinction for precision prevention and related terms
How many - all people → group → individual
Presence of good vs. absence of bad - thriving → not sick → sick
Level of risk - entire population → at risk individuals → those actually experiencing an adverse event or outcome
Intervention - 1, 2, 3 prevention → treatment
Actors - individual → clinical care provider → social service provider → public health practitioner → government
1.1 Public Health vs. Population Health
The National Academy of Medicine has defined public health as “what we as a society do collectively to assure the conditions in which people can
be healthy” [[8]]. The CDC has defined population health, in contrast, as “an opportunity for health care systems, agencies and organizations to work together
in order to improve the health outcomes of the communities they serve” [[9]]. The distinction was summed up by the Massachusetts Public Health Commissioner,
Dr. Monica Bharel, in a 2020 interview by saying, “Public health is about what we're doing as a society, and population health is about
what a system is doing for their community” [[10]]. However, Dr. Bharel goes on to point out that the distinction is largely artificial
and, in fact, does a disservice to patients and communities by perpetuating siloed
services and solutions [[10]]. In either case, the focus is on groups of people— whether a specific population
or an entire society—and while the work spans prevention to treatment (healthcare
systems are often not involved until there is an issue to be treated), there is an
emphasis on prevention, particularly in the context of value-based care. Beyond simply
prevention of disease, both definitions incorporate aspects of health promotion. With
the aspiration that people “be healthy” and have improved “health outcomes”, public
and population health aim not only to promote the absence of illness but an enhanced
state of well-being [[11]].
1.2 Precisely what do we mean by Precision?
The addition of the “precision” modifier tends to connote two additional aspects:
first, the use of more and more varied types of data to inform the practice, whether
medicine, public health, or population health.[[12],[13]] And second, the targeting of the practice to a more specific audience, whether
an individual or a group.[[4],[13]] Precision medicine shares with personalized medicine, stratified medicine, and even genomic medicine the use of greater amounts of data (clinical, physiologic, genomic, lifestyle, and
environment) to guide the right intervention for the right person (or group of people)
at the right time. In the same way, precision prevention (and precision public and population health) attempts to use data to refine traditional
prevention and health promotion approaches to better target specific groups within
a community. Precision can refer to an individual, but it may also refer to groups of people. At the extreme,
it can be thought of as anything short of the entire population. The terms “precision
medicine” and “personalized medicine” were meant to be juxtaposed with the term “one
size fits all” medicine, suggesting that medical treatment must be tailored to the
individual and not merely following a single guideline. Of course, clinicians will
rightfully point out that medicine has always been personalized in practice.[[14]]
1.3 All together now: defining Precision Prevention
The term precision prevention was first coined in the context of cancer and defined as involving the “use of biological, behavioral, socioeconomic, and epidemiologic data” [[15]] to reduce cancer incidence and mortality. Later, a distinction was made between
precision treatment and precision prevention, with the latter applied “‘above the skin’ to overcome psychosocial barriers, emphasize achievable goals, or
adapt to families' differing economic or cultural circumstances” [[16]]. Without actually using the term “precision prevention”, Pearson et al. explore all relevant issues, and pose a vision for precision public health as “the right intervention at the right intensity in the right community at the right
time” [[17]]. We therefore propose the following definition for precision prevention in the
context of this survey paper: “the use of biologic, behavioral, socioeconomic, and
epidemiologic data to inform the right intervention at the right intensity in the
right community at the right time, in order to prevent or reduce illness and improve
health”.
As illustrated in [Figure 1], it is estimated that 80% of health is driven by a combination of social and economic
factors (e.g., income and employment), health behaviors (e.g., diet and exercise), and physical environment (e.g., housing and transportation) [[18],[19]]. Consequently, access to high quality clinical care is necessary but not sufficient
for good health. The combination of non-clinical drivers of health are often referred
to as social determinants of health (SDOH) in the context of a group or community
and can have both positive and negative influences on health. In contrast, health
related social needs (HRSNs) are more specific when referring to a specific family
or an individual [[20],[21]]. To ensure every individual has the opportunity for health, it is essential to
tackle all of the influencing factors.
Figure 1: Relative contribution of socioeconomic factors, physical environment, health behaviors,
and health care toward the health of an individual. (Figure source: https://coveragetoolkit.org/health-equity-and-the-national-dpp/defining-health-equity). Prepared under Cooperative Agreement Number 5NU38OT000225-04, funded by the Centers
for Disease Control and Prevention).
In November 2023, the U.S. Office of Science and Technology Policy released a “Playbook”
to address SDOH and HRSNs [[20]]. This report describes an “ecosystem” of factors that impact health, as depicted
in [Figure 2]. The bottom third of the figure represents individuals and their healthcare. The
top two thirds of the figure— community conditions, environment, and individuals'
social needs— are the primary domain of both precision prevention and precision health
promotion.
Figure 2: The Social Determinants of Health (SDOH) Ecosystem, as depicted in [[22]].
Note that while social needs were not previously considered “targetable” for prevention,
recent developments in SDOH referral (and reimbursement) are in fact enabling intervention
in these areas, e.g., housing, transportation to help prevent (or mitigate) disease.
Additionally, community conditions and environment, while not generally actionable
per se, can help identify targeted populations and communities for outreach and services.
The precision prevention framework, and prevention in general, can be applied using
a tiered risk model: 1) the full population, 2) higher risk or “vulnerable” populations,
and 3) populations already experiencing adverse outcomes [[23],[24]]. In this way, precision prevention may be prophylactic, aimed at preventing disease
in the first place, or mitigating, aimed at reducing the severity or frequency of
adverse events for afflicted individuals. Both across population risk tiers, and through
the upstream, midstream, and downstream domains depicted in [Figure 2], novel and innovative collection, exchange, and use of data is required.
For this reason, it is not surprising that the first of three “pillars” of action
recommended in the SDOH Playbook [[20]] is to “Expand Data Gathering and Sharing.” Precision prevention requires innovation of data infrastructure and capabilities
at all levels. First, data relating to SDOH and social needs must be defined. Once
the appropriate data elements are identified, they must be collected in a standardized
way. Finally, data analytics and decision support can be applied across tiers to inform
interventions tailored to individuals and populations. The sections below delve into
each of these three areas.
1.4 A few Words on Scope
Note that the definition above includes biologic and epidemiological data. However,
full coverage of all of these data types would require far more space than this survey
permits. Fortunately, the collection and use of biologic and epidemiological data
are already well covered in the precision medicine and public health literature. In
this survey, we emphasize behavioral and socioeconomic data as they are the novel
aspects of the combined concepts of precision and prevention.
It should also be noted that while data are a primary focus in the field of informatics,
they represent just one aspect of current barriers to the realization of precision
prevention. Policy issues, misaligned financial incentives, evidence gaps, skills
gaps, siloed actors, and subconscious bias all represent considerable obstacles to
progress in this space. However, in the context of the International Medical Informatics
Association's Yearbook of Medical Informatics, we focus on data-related challenges and achievements.
2. Methods
The concept of “Precision Prevention” in the context of public health is novel enough
that a search for (“precision prevention” AND “public health”) in PubMed in the final
days of 2023 returns only ~100 hits, only 19 of which were published in 2023, 26 in
2022. Many of those articles focused on topics that fall under the more commonly used
term “precision medicine.” Conversely, many relevant publications do not actually
use the term “precision prevention.” In addition, many of the notable data-related
developments in this space over the past 1-2 years have been outside of peer reviewed
literature, including federal regulations, funded initiatives, and health information
technology (IT) trends. This emerging topic is thus better suited for a narrative
review approach than for a systematic review.
The authors first performed searches in PubMed and Google Scholar on variations of
related terms, e.g., “precision population public health”, “(data OR informatics) AND SDOH”, “precision
public health data”. An outline was developed using a combination of themes from the
resulting articles and the authors' perspectives and expertise. Sections were assigned
based on respective expertise. The Gravity section was written based entirely on author
(EG) expertise. Each other section was written using a combination of prior knowledge
and relevant search terms for that portion of the literature, e.g., “SDOH AND mHealth”, “SDOH AND EHR”. The sections were then combined and edited for
cohesion and space limitations.
3. Social Needs Data
We cannot improve what we do not measure. Just as there are several factors that can
affect a person's health risks, there are several data domains that become relevant
for a precision prevention approach. For precision prevention, we first need to consider
what data is important to gather at the individual level that incorporates social
needs, preferences, and person-centered goals [[25]].
Geospatial information can give clues to a person's relative economic resources, proximity
to health providers or grocery stores, or exposure to pollution or contaminants. Demographic
data, including age, race, ethnicity, language, sexual orientation, and gender identity,
can provide context for a person's cultural experience, or societal biases they may
encounter on a regular basis. Socioeconomic data, along with insurance status, similarly
give clues regarding multiple risk factors, including financial strain and access
to care. And disability status can illuminate other ways in which a person might be
part of a community with additional challenges. Some of these elements (e.g., race, ethnicity, preferred language, sex assigned at birth) were included as demographic
data elements in the first version of the U.S. Core Data for Interoperability (USCDI—
see below for more information) [[26]]. Others, such as gender identity, sexual orientation, and socioeconomic factors
were only added in USCDI Version 2. Disability status was added in Version 3, but
some have noted concern with the fact that it falls under health status rather than
demographics. They also note a lack of standards to use for this data element [[27]].
In addition to the types above, the past decade or so has seen the collection and
use of SDOH-specific data types: assessments, problems, goals, and interventions.
SDOH assessment data may be collected explicitly through a health questionnaire, coded
using LOINC, or collected implicitly through patient encounter notes. SDOH intervention
data might take the form of referrals for services documented in an electronic health
record (EHR) or related clinical IT system. Referrals may be thought of as a “prescription”
for services to address a social need, e.g., housing instability or food insecurity. Some of the more advanced tools in use today
are “closed loop referral systems (CLRS)”, where not only is the referral (the prescription)
captured, but also the status and resolution. For example, instead of providing a
person with a list of food banks near their location, a referral is sent to one or
more food-related community-based organizations (CBOs) which are also authenticated
users of the referral system. Those CBOs may then accept or reject the referral based
on their capacity (i.e., if they have the necessary resources and if the person in need meets their criteria
for services, whether by age, veteran status, etc.) and then update the status once
the service has been received. North Carolina's NCCARE360 [[28]] is an example of a CLRS, highlighted as “Innovation in Action” in the SDOH Playbook
[[20]].
3.1. Clinical Screening Data
Assessment of adverse SDOH is increasingly recognized as an important aspect affecting
care delivery and healthcare outcomes [[29]
[30]
[31]
[32]]. Unfortunately, these risk factors are not reliably captured in the EHR [[33]
[34]
[35]]. Where data quality issues are observed, they often further exacerbate demographic
disparities, with data missing “not at random” between different racial and ethnic
groups [[36],[37]]. While screening for SDOH measures is important, the process can be time-consuming
and resource-intensive in healthcare settings, adding complexity to already overloaded
clinical encounters [[38],[39]]. To add to the challenge, SDOH domains have not been reliably captured within structured
data elements in the EHR or in standardized formats that promote interoperability
and digital sharing [[40]].
The lack of standardization in how SDOH data is collected in clinical IT systems is
evident with the plethora of screening tools administered and the variety of data
standards applied to document and exchange this information across disparate systems
and networks. There are multiple validated SDOH screening questionnaires for diverse
populations, only some of which have been mapped to existing data standards [[41]
[42]
[43]]. Technology for digital screening includes health portals and tablets, but it is
not universally available in clinical care. If it is not available, then more traditional
paper screenings or in-person verbal formats are required. EHR vendors may have specific
SDOH data capture options but also allow for local configuration, adding to the lack
of standardization [[33]]. For unstructured SDOH documentation, translation into usable data requires additional
processing outside of clinical care using natural language processing (NLP) [[44]
[45]
[46]].
Direct digital formats for patient specific SDOH screening are preferable for their
efficiency in data capture. In the context of a clinical encounter, there are two
primary options: health portal questionnaires or tablet-based surveys. Health portal
questionnaires can be sent out to the patient before a visit. The use of health portals
among at-risk populations is lower than in other groups. Thus, the capture of SDOH
data through these portals may be lower [[47]]. At the health system in which one co-author is employed, even with more than 75%
of patients with a health portal account, completion of the SDOH questionnaire hovers
around 30%. With national health portal usage statistics around 60%, this format can
only be responsible for a minority of the total collection [[48]]. In-person tablet surveys can help in SDOH data collection but require technology
resources not available in many clinical care environments.
While many large providers and health plans have adopted the above technologies in
their workflow, for many patients the most common format for screening is a paper
survey at the time of a clinical encounter. The most significant limitation of this
format is that it still requires a healthcare team member to translate responses into
the patient's chart. Additionally, when the survey is administered verbally by the
health provider, the potential emotional vulnerability and the time it takes to complete
(locally 3-5 minutes) make this option the easiest to omit in a time-limited visit
[[49]].
For patients who have not been screened systematically, unstructured free-text narratives
in clinical notes in the EHR are an additional source of SDOH data. This method has
the benefit of being more specific to the patient's circumstance but is not accessible
discretely without additional NLP processing or labor-intensive manual review [[33]]. While there are many examples of research extracting individual-level SDOH domains
from clinical notes, most of these examples are limited to a few domains and are not
immediately available for clinical care [[46],[50],[51]]. International Classification of Disease (ICD), Version 10, Clinical Modification
(CM) “Z codes,” used in the U.S., offer an additional option to capture adverse SDOH
needs but are only used in a small minority of clinical encounters [[52],[53]]. Similar to clinical documentation narratives, this format requires clinician effort
and is relatively nonspecific. In some cases, when patient-specific data is not available,
or when a population approach is used, the Area Deprivation Index (ADI) or Social
Vulnerability Index (SVI) can be used to infer SDOH status [[54]
[55]
[56]].
The importance of cultural competence. Direct patient screening can pose challenges
as screening for social needs can be a sensitive and potentially stigmatizing experience,
requiring empathy and sensitivity to collect most effectively. One benefit of technology-mediated
surveys is that they can be self-administered. They produce less emotional vulnerability
without specific human interaction. If SDOH screening is performed without appropriate
sensitivity and cultural competence, the therapeutic relationship may be compromised.
Moreover, if screening is performed without ready intervention, additional harm may
be done [[57]]. To avoid these adverse outcomes, best practices for screening and referral include:
use of screening tools that are appropriate in terms of culture, language, and literacy;
integration of screening into existing clinical workflows; universal screening; and
a solid referral base [[58]]. When screening is completed, however it is done, the clinical team needs to recognize
the results and intervene on behalf of the patient.
3.2. mHealth
Mobile health (mHealth) technologies and applications are shifting how health data
is collected and utilized. Traditional health data collection methods, often centered
on clinical factors, fail to capture the nonclinical aspects that significantly influence
healthcare outcomes fully. In contrast, mHealth technologies can help introduce a
broader, more holistic approach to this issue [[59]]. These technologies enable real-time monitoring and gathering of health-related
data in everyday settings. As a result, this approach can offer a more comprehensive
view of patient health beyond clinical environments. By integrating nonclinical factors
into health data collection and deepening our understanding of patient health within
daily life contexts, we can create healthcare solutions more closely tailored to the
distinct health needs of individuals and communities.
Utilizing mHealth data is becoming increasingly important for healthcare professionals,
researchers, and patients [[60]]. This data can help patients with limited access to clinical services and bridge
information gaps during clinical encounters. A particularly significant and promising
aspect of mHealth data is its potential to meet the healthcare needs of historically
underserved communities [[61]]. mHealth apps enable users to track and manage symptoms of less-researched chronic
diseases [[62]]. For example, an app called Phendo[1] helps facilitate personalized monitoring and generate valuable patient-reported
data in the context of endometriosis. The data captured in this app is vital as it
enables individuals to comprehend their health conditions better and communicate more
effectively with healthcare providers, fostering the development of customized treatment
plans [[62]].
mHealth data is also instrumental in managing mental health issues, such as anxiety
and depression, especially for marginalized groups like Black women. Black women frequently
face a higher rate of adverse health outcomes related to these conditions [[63],[64]]. Due to systemic barriers restricting their access to standard healthcare services,
Black women are not always able to appropriately address these issues. Culturally
appropriate mHealth applications can help address this problem by providing Black
women with resources tailored to their specific health needs and preferences, thereby
increasing their involvement in managing their mental health effectively [[63]]. This shift towards leveraging mHealth data is central to advancing precision prevention.
It allows for creating interventions that are more accurate, culturally sensitive,
and effective, considering the distinct experiences and needs of various groups. This
approach marks a significant step towards making healthcare more equitable and accessible,
thus ensuring that all individuals, regardless of background, can access necessary
healthcare services.
3.3. Social Media
Social media emerges as a valuable medium for capturing real-time health experiences
of individuals and communities. This kind of data is vital because it can bridge the
gaps in health information that might be missing in EHRs or insufficiently discussed
and documented during medical visits. Platforms like X (Twitter), Meta (Facebook),
Instagram, TikTok, and others serve as platforms where people frequently discuss their
health challenges, experiences, and motivations. Social media provides insights into
the lived experiences of individuals, particularly in how HRSNs manifest and influence
healthcare access and understanding. Notably, platforms like TikTok offer a rich source
of video content where users openly share their experiences to foster awareness and
peer support and normalize conversations around chronic illnesses. Such data is instrumental
in understanding stigmatized topics and informs the development of online health communities
tailored to the needs of adolescents and young adults [[65]]. Similarly, platforms like Meta, Instagram, and Tumblr provide multimodal data
that sheds light on the health experiences of diverse groups, including fat[2], LGBTQ+, and disabled individuals, in both clinical and non-clinical settings. The
experiences of these communities, especially those with intersecting marginalized
identities, can significantly differ from those of cis-heteronormative, straight-sized
(people who are not fat [[66]]), and non-disabled individuals. For example, social media data can reveal the impacts
of issues like fatphobia, particularly in online discussions about anti-fatness among
individuals with eating disorders [[66]]. These platforms are crucial for building online communities, enhancing health
understanding, and developing health literacy among such groups. Thus, utilizing social
media data is critical to crafting early interventions and addressing major health
concerns, considering community input and the unique experiences shared on these platforms.
This approach can lead to more effective, tailored preventive health strategies that
acknowledge and address the diverse needs and challenges of various groups within
the population.
4. Data Standardization and Structure
4. Data Standardization and Structure
In the section above, we identify different types of data that may be relevant to
a precision prevention approach. For social media and mHealth data, storage and exchange
tend to be vendor-based. In some cases, application programming interfaces (APIs)
enable data access to social media data [[67]]. Researchers have made progress leveraging open mHealth standards for the integration
of person-generated data (PGD) into the EHR, but we are still in the early days [[68],[69]]. An area where significant progress has been made toward data standardization is
in SDOH data in the clinical context. Central to this work is a massive collaboration
known as the HL7 Gravity Project.
4.1. The HL7® Gravity Project
Over the past decade, growing investments in non-clinical interventions [[70],[71]], coupled with the growing number of national “calls to action” [[72]
[73]
[74]] for health systems to play a more prominent role in fostering whole-person care,
have spurred new policies and programs for collecting and using SDOH data in clinical
settings. The ability to link clinical and non-clinical data depends on using agreed-upon
terms and data standards across the technologies that collect, use, and exchange this
information. However, as described above, this standardization has been lacking.
A series of federal rule-making and related programs, beginning with the Health Information
Technology for Economic and Clinical Health (HITECH) Act in 2009 and the 21st Century Cures Act in 2016, have propelled standard-based approaches to support patient-level
electronic data collection and exchange [[75]]. The HITECH programs incentivized the use of a common set of medical terminologies
and codes to represent concepts such as race, ethnicity, and preferred language but
were limited in requirements on other concepts for social risk assessments, diagnosis,
goals, and interventions.
In 2019, following a series of multi-stakeholder discussions with experts in SDOH
data from healthcare, community health, government, academia, digital health, and
health informatics, the Social Interventions Research and Evaluation Network (SIREN),
with funding from the Robert Wood Johnson Foundation and in partnership with EMI Advisors,
launched the Gravity Project. The Gravity Project is an open multi-stakeholder public
collaborative and HL7 Fast Healthcare Interoperability Resource (FHIR®) accelerator of over 3,000 participants focused on developing, testing, and publishing
national SDOH data standards for use in digital technologies including EHRs [[76]]. The data standards support both structural-level interoperability [[77]] and semantic-level interoperability. The structural standards incorporate the HL7
FHIR specification, while the semantic standards are specified by the type of clinical
activity the SDOH data concept represents. The coded vocabularies and value sets used
to represent the SDOH concepts are enumerated in [Table 1]. SDOH-related domains and the respective terminologies designated through the Gravity
Project..
Table 1.
SDOH-related domains and the respective terminologies designated through the Gravity
Project.
Use
|
Standard
|
Screening/assessment question/answer pairs
|
LOINC®
|
Observations
|
SNOMED CT®
|
Diagnoses
|
ICD-10
|
Goal setting
|
LOINC®
|
Interventions
|
SNOMED CT®
|
Billing
|
CPT®/ HCPCS (Current Procedural Terminology / Healthcare Common Procedure Coding System)
|
The Gravity Project further categorizes SDOH interventions into nine types, as presented
in [Table 2]. SDOH intervention types as defined through the Gravity Project..
Table 2.
SDOH intervention types as defined through the Gravity Project.
Gravity Term
|
Gravity Definition
|
Example
|
Adjustment
|
Adjustment of clinical plan to accommodate social risk
|
Adjustment of clinical plan to accommodate social risk. SNOMED CT 1269404007
|
Assistance/Assisting
|
To give support or aid to; help
|
Assistance with application for food pantry program. SNOMED CT 467771000124109
|
Coordination
|
Process of deliberately organizing activities and sharing information to achieve safer
and more effective care aligned with patient preferences
|
Coordination of resources to address food insecurity. SNOMED CT 1004110005
|
Counseling
|
Psychosocial procedure that involves listening, reflecting, etc., to facilitate recognition
of course of action/solution
|
Counseling for food insecurity care plan participation barriers. SNOMED CT 464681000124109
|
Education
|
Procedure that is synonymous with those activities such as teaching, demonstration,
instruction, explanation, and advice that aim to increase knowledge and skills, change
behaviors, assist coping and increase adherence to treatment
|
Education about Senior Farmers' Market Nutrition Program. SNOMED CT 464341000124108
|
Evaluation of eligibility (for X)
|
Process of determining eligibility by evaluating evidence
|
Evaluation of eligibility for Senior Farmers' Market Nutrition Program. SNOMED CT
464651000124101
|
Evaluation/ Assessment
|
Determination of a value, conclusion, or inference by evaluating evidence
|
Assessment for food insecurity. SNOMED CT 1002224003
|
Provision
|
To supply/make available for use
|
Provision of food voucher. SNOMED CT 464411000124104
|
Referral
|
The act of directing someone to a different place or person for information, help,
or action
|
Referral to food prescription program. SNOMED CT 464061000124105
|
As of early 2024, the Gravity Project has published value sets in the National Library
of Medicine (NLM) Value Set Authority Center (VSAC)for 19 SDOH domains [[78]], including food insecurity, housing instability, homelessness, and transportation
insecurity. Value sets are lists of specific values (terms and their codes) that define
clinical concepts derived from standard vocabularies to support effective health information
exchange. The Gravity Project defined value sets establish boundaries around what
terms are allowed for a documented SDOH condition or what terms serve as a denominator
for an SDOH quality measure.
In 2021, the Gravity Project SDOH value sets were incorporated into the Office of
the National Coordinator for Health IT (ONC) USCDI Version 2 [[26]]. The USCDI is a standardized set of health data classes and constituent data elements
for nationwide, interoperable health information exchange. As noted in the ONC 2022
Report to Congress [[79]], USCDI is effectively the minimum dataset of a health system. It serves as the
floor of what data are available via the health IT systems of most hospitals and doctors'
offices and the major consumer health apps that give consumers online access to electronic
health information. In addition to the core data set within USCDI, ONC also supports
an extensible data set, USCDI+, to meet specific programmatic and/or use cases required
by federal and industry partners, particularly in the areas of public health, care
quality, cancer, behavioral health, and maternal health [[80]].
The first version of the USCDI was adopted as a standard in the ONC Cures Act Final
Rule [[81]] and thereby made a requirement for certified health IT developers to support in
their products. The ONC Cures Act Final Rule also includes policies promoting the
use of secure, standard-based APIs that encourage the development of health apps that
provide access to and use of data in EHRs to better support person-centered care and
patient empowerment. In December 2023, ONC published the Health Data, Technology,
and Interoperability: Certification Program Updates, Algorithm Transparency, and Information
Sharing (HTI-1) Final Rule [[82]], which makes updates to the ONC Health IT Certification Program with new standards.
The final rule specifically raises the baseline version of the USCDI from Version
1 to Version 3, thereby making it a requirement for certified health IT developers
to capture and exchange the USCDI V3 data elements, including SDOH elements and the
Gravity Project's relevant value sets, by January 1, 2026. USCDI, by default, is agnostic
to the data model and transport mechanisms adopted to exchange electronic information.
USCDI provides a common language and meaning to the information that can be collected
and exchanged using common data models like FHIR and OMOP (Observational Medical Outcomes
Partnership).
4.2. The Need for Data Granularity
The work of the Gravity Project and associated data classes included in the USCDI
underscore the importance of specifying agreed-upon standards to represent key concepts
for documenting individual social risks and social needs in electronic systems. Although
distinct coded terminologies are used to differentiate between a social risk screening
activity (documented in a LOINC® code) and a clinical diagnosis (documented as an
ICD-10 Z code), these concepts are grouped as value sets published in the NLM VSAC.
Most recently, the Gravity Project published a crosswalk of coded concepts for the
Accountable Healthcare HRSNs screening tool [[83]].
Data standards play a key role in precision prevention. First, they promote the collection
of individual-level data to facilitate clinical and administrative decision-making
and quality improvement. This highlights a significant cultural shift in healthcare
providers incorporating the patient's voice in the information they document and use
in care delivery. Second, standards facilitate the sharing of this information electronically
across disparate systems of care, social services, and research. We begin to move
from a unidirectional clinical approach to a bi-directional and multi-directional
electronic exchange. Third, they facilitate payment for social risk and social needs
data collection and intervention activities. Ultimately, it is about collecting the
data once and being able to reuse it for upstream (e.g., payment model design) and downstream activities (e.g., claim reimbursement and resource utilization). For example, payment models can address
and evaluate SDOH in various ways, including incorporating screenings and referrals,
social risk adjustment, direct social services funding, community convening, and health
equity statements. The utility of the Gravity Project standards, then, is clearly
not limited to the clinical context in which they are used, but also useful for the
broader goal of precision prevention.
5. Data Use for Precision Prevention
5. Data Use for Precision Prevention
Once data relevant to precision prevention can be captured, stored, and exchanged,
it can be leveraged in various contexts to facilitate addressing those needs, thus
contributing to the prevention or reduction of illness.
5.1. Clinical Setting
Identifying unmet social needs in a clinical setting can facilitate connecting people
to resources that can facilitate prevention, but it does add another layer of complexity
[[21]]. Giving information about resources does not always translate into the person receiving
resources. In addition, it is necessary to use trauma informed practices to meet the
patient where they are with a need.
Closed-loop referral systems (CLRS) like Findhelp and Unite Us provide extensive resource
directories within their technology platforms to connect people with needs to resources
to address those needs.[[84]] Using a combination of geolocation and CBOs specific requirements for service,
these tools match people to the best options for their needs and provide a mechanism
to make a referral to these services. The goal of these platforms is to provide a
“no wrong door” closed-loop referral where a social needs referral can not only be
made but can be tracked from when it is placed to when the resource is provided and
the need addressed.
A systematic review by Pourat et al. [[85]] concluded that while significant progress has been made in collecting SDOH-related
data in EHRs, more effort is needed both to take action based on that data and to
evaluate outcomes following intervention. Iott et al. [[86]] found relatively low rates of engagement with SDOH data collected. Clinical locations
with dedicated support staff, such as a population health nurse trained in social
resource allocation, have been found to be more successful in placing initial referrals
[[21]]. Even with placing of referrals, there can still be multiple points of failure
to a successful connection [[87]].
5.2. Clinical and Non-clinical Data Linkages
Linking clinical and non-clinical data can also inform tailored and focused resources
to impacted communities and individuals to prevent or reduce illness and improve health.
For example, the Environmental Justice Index [[88]] combines social risk and environmental exposure data with clinical disease burden
for a community. This can inform policies and interventions to lower the burden of
exposure and disease in an impacted community. Linking clinical data from EHRs (e.g., people with diabetes) with non-clinical data (e.g., food insecurity) can help focus care management activities to people with chronic
diseases.
We can combine several sources of non-clinical data to identify communities with a
higher burden of risk factors, for example the SVI, to focus resources for particular
communities. We can further drive action to higher risk populations by linking data
sets. For example, by linking data sets in public programs (e.g., Medicaid, WIC, SNAP), state agencies can identify who may be eligible for a public
assistance program, but not enrolled [[89]]. This can inform outreach activities. Further linking to data indicative of food
insecurity (e.g., screening data or data relating to referrals to food resources) can prioritize outreach
to the highest risk individuals. In addition, linking clinical and non-clinical data
sets can inform deployment of resources. For example, robust and granular data on
vaccination status by race, ethnicity, and geography combined with SVI of communities
can inform strategic deployment of vaccine resources to populations with lower vaccination
rates in communities of higher social vulnerability, and did exactly that in the context
of COVID-19 [[90]].
5.3. Population Level
SDOH data can be used to apply targeted precision prevention to different risk tiers
in the population [[23]]. At the population level, data can be utilized to identify priorities for population
health improvement and track progress in improvement through initiatives such as Healthy
People 2030 for the nation [[91],[92]] and through state adaptations, such as Healthy North Carolina 2030 [[93]]. These full population measures contain both clinical metrics, for example early
prenatal care, and non-clinical metrics such as access to healthy foods. Higher risk
populations can be identified by disaggregating full population data to identify disparities
in risk factors, for example people living in poverty. Pre-natal care services and food programs
can then be promoted among high-risk communities. Finally, tailored interventions
can be targeted for the segment of the population experiencing undesirable outcomes, for example populations with higher rates of infant mortality
or communities located within food deserts [[23]].
6. Data Governance Considerations
6. Data Governance Considerations
Some of the data described above, for example self-administered surveys, mHealth data
and social media data, are non-clinical data and are collected directly from the subject,
in many cases outside of the clinical settings. This has implications related to data
governance, including both data quality and privacy issues. While privacy rules around
clinical data are well established under Health Insurance Portability and Accountability
Act (HIPAA), significant ambiguity remains regarding the collection and re-use of
data that does not originate through a HIPAA “covered entity”, i.e., a healthcare provider or payer.
Though it is commonly recognized that clinical data are far from pristine, non-clinical
data introduces a whole new realm of data quality issues. These issues apply to data
that are actively self-reported through surveys or apps, passively collected through
mobile activity trackers, or some combination of these approaches for social media—
user-supplied in the moment and aggregated over time by social media companies as
an effective longitudinal record of activities and mindset. These methods introduce
multiple potential sources of bias including who is providing the data (often affluent
white people who can afford technological gadgets) [[94]], when they are doing so (even for passive collection methods, an activity tracker
may be removed), how accurately the data may be collected (e.g., sensors that are
dependent on skin tone) [[95]], and how many impressions a given post might have (depending on the algorithm used
by a social media company) [[96]].
User perceptions of data privacy and concerns related to confidentiality vary widely,
both across nations and individuals [[97]]. One major concern is the ability to re-identify data that had previously been
considered de-identified [[97]]. Another major apprehension concerns the security and privacy of their health-related
data, mainly due to ambiguity surrounding the types of data collected and stored and
uncertainties about who can access this data [[98]]. Interestingly, a small study found that a group of older (>45) users appear more
concerned about use of data to improve personalization of ads than they are concerned
about misuse of personal health data [[98]]. Users of mHealth apps dealing with stigmatization, discrimination, sexually transmitted
diseases, and mental illness, among other issues, express heightened security concerns
[[98]]. The landscape has been further complicated in the post-Dobbs era, with significant
concerns regarding law enforcement's use of data from applications tracking menstrual
cycles [[99],[100]]. mHealth researchers and developers must prioritize user privacy and confidentiality
to address these concerns effectively. Additionally, the legal landscape must catch
up to where technology is going. To this end, Galvin & DeMuro and others have proposed
the concept of a “healthcare fiduciary” [[97]]. Unfortunately, even where laws exist, they are often not followed [[97],[100]].
7. Conclusions
Significant progress has been made in the field of informatics for precision prevention.
This advancement has been catalyzed by numerous incentives including national policies,
payment models, program reforms, and quality reporting. Critical developments include
the work from the Gravity Project and open value sets, maturation of the USCDI to
include SDOH-related domains, increased use of social media and mHealth among specific
communities and populations, and the adoption of innovative closed loop referral systems
to address social needs. Still, many challenges and opportunities remain in this space.
Certified health IT developers will need to update their systems to support the USCDI
Version 3 data classes and thereby adopt coded data elements for the four SDOH data
classes no later than January 2026. Given this directive, public and private entities
need to further collaborate on education and outreach activities that promote the
collection and use of structured SDOH data and the evaluation of SDOH interventions.
Currently, awareness of how to integrate clinical and social care data is nascent
across most health systems and community-based organizations. Similarly, the limited
use of FHIR-based open APIs by local networks for directed exchange (push) and national
networks for query-based exchange creates ongoing barriers for interoperability across
clinical and social services systems.
There is limited evidence on what SDOH interventions benefit the right population
at the right time. Even though nationally recognized data standards exist to represent
SDOH interventions across 19 domains, more work is needed to identify what interventions
work best for specific populations and groups by race, ethnicity, and language. Relatedly,
current funding models do not support ongoing monitoring of the effectiveness of tailored
SDOH interventions by population and group. These challenges point to an opportunity
to apply the Learning Health System (LHS) model in the context of precision prevention.
LHSs are designed to use data-driven learning processes to continuously and iteratively
research and improve clinical practice. They leverage the data collected through clinical
care to inform research questions and, ultimately, clinical guidelines, which then
generate more clinical data. The LHS life cycle is an ideal format to extend and refine
the effectiveness of precision prevention health interventions. By tailoring the needs
of specific individuals or communities based on their unique risk factor profiles,
targeted population-level prevention strategies provide more specific and tangibly
effective resources/interventions.
While increased use of mHealth and social media in this space is promising, particularly
for traditionally marginalized populations, there is significant uncertainty around
the storage, exchange, and use of data that is derived from non-HIPAA covered entities
from both a legal and ethical perspective. These topics must be addressed, with significant
input from patient and consumer advocates.